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Introduction 



This thesis reflects my interest in the history of mathematics, and in areas of math- 
ematics which combine geometry and algebra. The historical and mathematical 
sections of this work can be read independently. One of the main themes of the 
history, though, is how in the nineteenth century a new understanding of mathe- 
matical space developed, in which groups became vital to geometry. The idea of 
using algebraic concepts to investigate geometric structures is present throughout 
the work. 

Chapter 1 is a short history of non-Euclidean geometry. This chapter is a syn- 
thesis of my readings of mainly secondary sources. Historians of mathematics have 
traditionally adopted an 'internalist' approach, explaining changes to the content 
of mathematics only in terms of factors such as the imperative to generalise, or 
the desire to remove inconsistencies. An internalist narrative is certainly necessary 
for a satisfactory history of mathematics, but it is not sufficient. First, the history 
of mathematics is part of broader intellectual history. Euclidean geometry held 
great philosophical prestige, and non- Euclidean geometry challenged fundamental 
assumptions about the nature of space and the truth value of mathematics. Second, 
there are social factors in the history of mathematics. Mathematicians themselves 
are part of a community, the mathematical community. The structure of this com- 
munity in the nineteenth century helps to explain the reception and dissemination 
of non-Euclidean geometry. The sources consulted for this chapter each adopted 
one or more of the approaches to the history of mathematics outlined here, but 
none of them seemed to discuss all the relevant issues. 

Chapter 2 begins the strictly mathematical part of this thesis. One aim of this 
chapter is to provide a deeper understanding of some of the mathematics discussed 
in Chapter 1. To this end, each of the main models of hyperbolic geometry is pre- 
sented, and important results for each model proved. The second aim of Chapter 2 
is to describe thoroughly the way in which the action of the group PSL{2, Z) in- 
duces a tesselation of the upper half-plane. Most of the material in Chapter 2 is 
selected and adapted from Chapters 3-6 of Ratcliffe [21] . 

Chapter 3 poses a question about the upper half-plane, and then provides the 
theory needed to frame the question precisely and answer it in a variety of settings. 
Suppose z is a point in the upper half-plane, and •yz the result of the action of 
an element 7 G P5'L(2,Z) on the point z. The geodesic segment joining z and 
'jz has a finite length, and crosses a finite number of tiles in the tesselation of the 
upper half-plane. The question is whether or not there is any relationship between 
the length of the geodesic segment, and the number of tiles it crosses. We begin by 
defining a symmetric space. Then, if X is a symmetric space, and F a discrete group 
of isometrics of X, we define two distance functions on the group F. The first is the 
geometric distance function, and is induced by the distance function on the space 
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X. The second distance function on T is the word distance function. The word 
distance function is induced by the generators of F, and corresponds to the number 
of tiles in the tesselation crossed by a geodesic segment. We now ask whether or not 
these two distance functions are equivalent. We prove that, when the tiles of the 
induced tesselation are compact, the two distance functions are equivalent. Then, 
in order to generalise the action of PSL{2, Z) on the upper half-plane, we describe 
a symmetric space of matrices on which the group PSL{n, Z) acts. 

Chapter 4 is devoted to the proof of Theorem I4.1.1[ This theorem states that, 
when n = 2, the geometric and word distance functions on PSL{n, Z) are not 
equivalent, but that for all n > 3, these two distances on PSL{n, Z) are equivalent. 

The content of Chapter 3 and Chapter 4 is based upon two papers by Lubotzky, 
Mozes and Raghunathan, [13j and [14j. These papers are not easy to read. There is 
sometimes ambiguity about the hypotheses on the space X and the discrete group F 
acting on X. We have, here, assumed X to be a symmetric space. This assumption 
ensures that geometric distance functions and word distance functions exist, and 
that structures such as geodesies and fundamental domains behave nicely. The 
authors do not prove the 'well-known' result that the geometric and word distance 
functions are well-defined up to Lipschitz equivalence, so we fill this gap. They do 
not discuss the interesting geometric interpretation of the two distance functions, in 
terms of a geodesic crossing tiles in the space. Also, these papers inaccurately refer 
to what we have called the geometric distance function as a Riemannian metric. 
The more recent article, [H] , was used only for the proof of Theorem 13.4.11 By 
assuming that X is a symmetric space rather than a 'coarse path metric space', we 
have simplified the proof a httle, as we may choose the sequence of points {pi}™ q 
lie on the geodesic. Besides, it is not obvious that a 'coarse path metric space' even 
contains a point with a trivial stabiliser subgroup in F. The proofs in Chapter 4 
are based on those in the earlier paper, [13], which concerns Lipschitz equivalence 
for the group PS'L((i, Z). This paper does not describe the symmetric space on 
which PSL{d, Z) acts, so we have added this material. Also, we have, throughout 
Chapter 4, rewritten statements of lemmas, propositions and so on, in order to 
clarify what each of the many constants depends upon. In the paper, for the case 
d = 2, the claim that the word length of grows linearly in n is not proved. For 
the cases d > 3, there is no explicit statement of Theorem 14.4.11 and no indication 
of how to combine the results of Corollary 14.4.101 and Corollary 14.4.141 to prove 
the result of Theorem 14.4.11 In the proof of Proposition I4.4.4[ results I4.4.51fl^^^ 
are stated without proof. In Proposition 14.4. 10| the construction suggested by the 
authors does not actually satisfy the statement of the proposition, and there is no 
proof of the claims of Lemmas 14.4.111 and 14.4.121 Finally, the way in which the 
result of Theorem 14.4.11 implies that the word distance function is bounded above 
by the geometric distance function is not proved. 
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Chapter 1 

A History of Non- Euclidean Geometry 



/ entreat you, leave the doctrine of parallel lines alone; you should fear it 
like a sensual passion; it will deprive you of health, leisure and peace — it 
will destroy all joy in your life. 

from a letter by Farkas Bolyai to his son Janosll] 

/ have now resolved to publish a work on parallels . . . the goal is not yet 
reached, but I have made such wonderful discoveries that I have been 
almost overwhelmed by them . . . I have created a new universe from 
nothing. 

from a letter by Janos Bolyai to his father FarkasH 

1.1 Introduction 

This chapter is a short history of non-Euclidean geometry. The intellectual history 
of developments internal to mathematics is presented, together with discussion of 
relevant social and philosophical trends within the mathematical community and 
in wider contexts. Chapter 2 will provide a rigorous and modern treatment of 
many of the mathematical terms and results referred to here. In order to properly 
acknowledge sources, the referencing style adopted for Chapter 1 is different from 
that in the rest of this thesis. 

The discovery and acceptance of non-Euclidean geometry forms a crucial strand 
of the history of mathematics. For thousands of years, Euclidean geometry held 
great prestige. However, there were small doubts about the parallel axiom. After 
numerous attempts to validate this axiom, in the nineteenth century there was a 
shift to advancing alternative geometries. The coming-together of developments in 
many areas of mathematics aided the acceptance of non-Euclidean geometry by the 
mathematical community. In this process mathematicians were forced to reconsider 
and alter their ideas about the nature of mathematics and its relationship with the 
real world. 

1.2 Euclid's Elements 

Although Egyptian and Babylonian mathematicians had posed and solvedgeomet- 
ric problems, geometry only acquired logical structure with the Greeksjj Greek 

^Quoted in B. A. Rosenfeld. A History of Non-Euclidean Geometry: Evolution of the Concept 
of a Geometric Space. Springer- Verlag, 1988, p. 108. 

^Quoted in Jeremy Gray. Ideas of Space: Euclidean, Non- Euclidean and Relativistic. Claren- 
don Press, 1989, p. 107. 

^Carl B. Boyer. A History of Mathematics. John Wiley & Sons, Inc., 1991, p. 47. 
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geometers were the first to write proofs as deductive sequences of statementso Eu- 
clid's thirteen- volume Stoichia (Elements), which appeared in about 300 B.C.E., 
was an exposition of the fundamentals of classical Greek mathematics. The first six 
books covered plane geometry, and the last three solid geometry. 

The Elements opens with a list of 23 definitions, followed by ten axioms!^ Propo- 
sitions and their proofs fill the remainder of the work. This logical structure re- 
fiected the efforts by mathematicians of the classical period to establish geometry 
as a deductive system dependent on as few assumptions as possible|§ 
Euclid's fifth axiom, which is also known as the parallel axiom, states: 
If a straight line falling on two straight lines makes the interior angles 
on the same side less than two right angles, the two straight lines, if 
produced indefinitely, meet on that side on which the angles are less 
than the two right angleslll 
To understand this, suppose that the two straight lines are / and m, and that the 
transversal falling on them is line t. The axiom is saying that if the interior angles 
on the same side of t sum to less than vr, the three lines /, m and t form a triangle. 
This triangle has two vertices on the line t. Its third vertex lies in the half-plane 
bounded by the line t, and on the same side of t as the interior angles which sum 
to less than tt. 

t 

I 

m 

The parallel axiom does not say anything about the possibility that the pairs of 
interior angles on both sides of the transversal sum to vr. With the word parallel 
defined as meaning 'never intersecting', Euclid proved, without using the fifth ax- 
iom, that if a transversal falling on two straight lines makes the interior angles on 
the same side add to vr, then the two lines are parallel. To prove the converse of 
this statement, Euclid did need to use the parallel axiom. The parallel axiom was 
also essential to the proof of the key theorem that the angle sum of a triangle is vr. 

The Elements made little use of the idea of motion in definitions or methodsjfl 
In the sixth century B.C.E., Pythagoreans had defined a line as the trace of a moving 
point, and a surface as the trace of a moving line. But Euclid largely conformed to 
the views of Aristotle (384-322 B.C.E.), who rejected motion in geometry because 
he considered mathematical objects to be abstractions of physical objects. So, in 
the Elements, a line is a "breadthless length" rather than the successive positions 
of a moving point Euclid did use superposition of geometric figures in the proofs 

^Roberto Torretti. Philosophy of Geometry from Riemann to Poincare. D. Reidel Publishing 
Company, 1984, p. 2. 

^The ten axioms are sometimes called five postulates and five common notions; surviving 
manuscripts of the Elements are not consistent. See Boyer, History of Mathematics, p. 105. 
^Torretti, Philosophy of Geometry, p. 5. 
^Quoted in Boyer, History of Mathematics, p. 106. 
^Rosenfeld, History of Non-Euclidean Geometry, pp. 110-112. 
^Gray, Ideas of Space, p. 28. 




2 



of Propositions 4 and 8, but chose to prove Proposition 26, where he could also 
have used superposition, with a longer method instead. This suggests that he was 
unhappy with the method of superposition!^ 

Euclidean geometry contained many ambiguities and hidden assumptions. The 
opening definitions, such as "a surface is that which has length and breadth only" , 
served no logical purpose, since Euclid provided no prior set of undefined concepts 
in terms of which definitions could be framed0 Euclid's second and third axioms, 

2. [It is possible] to produce a finite straight line continuously in a 
straight line, 

3. [It is possible] to describe a circle with any centre and radius0 
embody the assumption that geometric space is infinite. Greek geometers also 
assumed that space was homogeneous, so that any construction could be performed 
at any point with the same resultso The congruence of figures depends on this 
property of space. The existence of parallel lines was also taken for granted!^ 

Despite these logical weaknesses, the Greeks and many later thinkers considered 
the Elements to be the model for philosophical inquiry. After the fall of Rome, the 
survival and development of Greek mathematics, including Euclidean geometry, 
depended on Arabic-speaking mathematicians in Islamic Africa and Asia. In the 
twelfth century, Arabic translations of the Elements were translated in turn into 
Latin. The Elements were first printed in 1482, and versions in various European 
vernaculars appeared in the sixteenth century. Euclid's Elements achieved huge 
influence in Europe, with more than one thousand editions published]^ Until about 
1800, Euclidean geometry enjoyed secure philosophical and mathematical status. 

1.3 Doubts about the parallel axiom 

From early on there were, though, questions raised about the parallel axiom. A 
lot of basic geometry, such as the angle sum of a triangle, depended on it, so its 
soundness was very important. Yet the parallel axiom appeared much less self- 
evident than the others. Euclid's formulation of the axiom seemed long-winded and 
complicated, and it was not clear that assuming the existence of infinite straight 
lines was reasonable^ Mathematicians did not seriously doubt the truthfulness of 
the axiom until the nineteenth century; they just wanted to make sure that it was 
formally unassailable. Proclus (410-485 C.E.), for example, wrote that the parallel 
axiom 

. . . ought to be struck from the axioms altogether. For it is a theorem 
— one that invites many questions!^ 



^°Morris Kline. Mathematical Thought from Ancient to Modern Times. Oxford University 
Press, 1972, p. 60. 



^^Boyer, History of Mathematics, p. 105. 

Quoted in Kline, Mathematical Thought, p. 59. 
^•^Gray, Ideas of Space, p. 26. 
"Ibid, p. 29. 

"'^^Boyer, History of Mathematics, p. 119. 
^^Kline, Mathematical Thought, p. 863. 



^^Proclus. A Commentary on the First Book of Euclid's Elements. Princeton University Press, 
1969, p. 150. 
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Many mathematicians tried to settle uncertainty about the parallel axiom by 
proving it from the other nine axioms. They were all unsuccessful, as they relied 
on implicit and unsupported assumptions, some of them actually equivalent to the 
parallel axiom. In his 'proof, Ptolemy (d. 168 C.E.) made tacit assumptions about 
parallel lines. Proclus found the flaws in Ptolemy's work, but then based his own 
efforts on a rather dubious Aristotelian axiom about the finiteness of the universe, 
which was inconsistent with Euclidean assumptions of the infinitude of space, ^ 
Thabit ibn Qurra (836-901) wrote two treatises trying to prove the parallel axiomj^ 
In the first, he assumed that if two lines diverge on one side of a transversal, they 
converge on the other side. In the second, he used kinematic arguments, asserting 
that 

If any solid is imagined to move as a whole in one direction with one 
simple and straight movement, then every point in it will have a straight 
movement and will thus draw a straight line on which it will passo 
Thus, the curve at a fixed distance from a straight line is itself straight. Ibn al- 
Haytham (965-1041) gave a false proof of this last statement, then used it to prove 
the parallel axiom. The first substantial European effort on parallels was that of 
John Wallis (1616-1703), who realised that many of Euclid's proofs depended on 
unstated assumptions]^^! He derived the parallel axiom by assuming the existence 
of triangles similar to a given triangle. Other European attempts included those of 
Giordano Vitale (1633-1711), and A. M. Legendre (1752-1833). 

A different approach was to attempt to replace or reformulate the parallel axiom. 
Ptolemy, Proclus, and the Arabian editor of Euclid, Nazir-Eddin (1201-1274), as 
well as the Europeans Wallis, Joseph Fenn, John Playfair (1748-1819) and Legendre, 
all attempted substitute axioms. The version generally given in modern textbooks 
is due to Playfair: "Through a given point P not on a line / there exists only one 
line in the plane of P and / which does not meet F @ All these efforts, though, were 
unsatisfactory, relying on assertions about infinity which were no more self-evident 
than those in Euclid's version 1^ 

A third strategy was to investigate the consequences of denying the parallel 
axiom. Aristotle connected negating the axiom with triangles having an angle sum 
greater than 7r@ The resulting geometry is actually spherical geometry. From a 
modern point of view, there are no parallels in spherical geometry, because all 'lines' 
(great circles) intersect. However, despite its antiquity, and its many applications 
in navigation, spherical geometry was never perceived as a challenge to Euclidean 
geometryj^ This was probably because spherical geometry was not considered to be 
a geometry of the plane@ For example, in the first work in the area, the Sphaerica 



^^Kline, Mathematical Thought, pp. 863-864. 

^^Rosenfeld, History of Non-Euclidean Geometry, pp. 49-56. 

^OQuoted in ibid., p. 52. 

^^Gray, Ideas of Space, p. 57. 

^^Quoted in Kline, Mathematical Thought, p. 865. 

^•^Klinc, Mathematical Thought, p. 866. 

^^Gray, Ideas of Space, pp. 33-34. 

^^Rosenfeld, History of Non-Euclidean Geometry, pp. 1-33. 
^^Gray, Ideas of Space, p. 71. 
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of Theodosius (c. 20 B.C.E.), propositions were formulated mostly in terms of planes 
intersecting the sphere, rather than in terms of the sphere's intrinsic geometry. 

Many mathematicians who considered negating the parallel axiom found that 
their conclusions were counter-intuitive. Common sense and everyday experience 
favoured Euclidean geometry. Some abandoned their efforts. For instance, Omar 
Khayyam (10487-1122) entertained the possibility of a quadrilateral with angle sum 
different from 27r, but straight away dismissed the idea as inconsistent. 

Girolamo Saccheri (1667-1733), though, investigated in some depth the geome- 
try which resulted from denying the parallel axiom. He was the first to try to prove 
that a contradiction would follow from supposing the parallel axiom false. Saccheri 
began by establishing that ZACD = ZBDC in the following quadrilateral. 




He then considered three hypotheses about angles ACD and BDC: they could 
both be obtuse, or both right angles, or both acute@ He showed that these three 
hypotheses implied that the angle sum of a triangle was respectively more than, 
equal to, or less than tt. Implicitly assuming that straight lines could be extended 
indefinitely, and so ruling out spherical geometry, Saccheri then rejected the obtuse 
angle hypothesis, by showing that it implied the parallel axiom. This was a contra- 
diction, since the parallel axiom implies that ACD and BDC are right angles. He 
then considered the acute angle hypothesis. Assuming that it held, Saccheri proved 
several theorems, and found that two straight lines might diverge in one direction 
and come asymptotically close together in the other, so that they had a common 
point at infinity. Although he found this idea repugnant, he only rejected the acute 
angle hypothesis after deriving a contradiction from an error he made in computing 
arc length@ 

Because there had been no rigorous invalidation of the results of negating the 
parallel axiom, a few mathematicians began to speculate that the alternative ge- 
ometry might hold somewhere outside everyday experience. 

J. H. Lambert (1728-1777) considered the same hypotheses as Saccheri, and like 
Saccheri obtained from the obtuse angle hypothesis a contradiction. In Euclidean 
geometry, he observed, there is an absolute measure of angles, but length varies 
according to the unit of length selected. Under the acute angle hypothesis Lambert 
found he could actually define an absolute unit of length, by associating one line 
segment with one angle. He also saw that the area of a triangle was proportional to 
its angle sum. Counter-intuitive though such consequences seemed, Lambert was 
unable to reject the acute angle hypothesis. He supposed that it might hold on 
some "imaginary sphere" , a concept he never explained clearly@ 

Ferdinand Karl Schweikart (1780-1859) and his nephew Franz Adolf Taurinus 
(1794-1874) were influenced by Saccheri and Lambert's work. Assuming the angle 
sum of a triangle was not two right angles, they obtained what they called "astral 



^^Rosenfeld, History of N on- Euclidean Geometry, pp. 98-99. 
28lbid., p. 99. 
29lbid., p. 101. 
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geometry", since it might occur in the region of the stars. Schweikart and Tau- 
rinus were the first to use analysis in non-Euchdean geometry, and this enabled 
progress past the classical approach of Saccheri and Lambert Ifj Their analytic re- 
sults included trigonometric formulae, and the area of a triangle with a vertex at 
infinity, in terms of a constant K. They derived their trigonometric formulae from 
those of spherical trigonometry, by replacing ordinary trigonometric functions with 
hyperbolic trigonometric functions. The radius of the sphere concerned was K. 

Carl Friedrich GauB (1777-1855) did a little work on non-Euclidean geometry 
at the same time as Schweikart and Taurinusl^ He never published anything in 
the area. According to his letters and rough notes, GauB realised that rejecting the 
parallel axiom implied the existence of an absolute measure of length, and doubted 
whether Euclidean geometry could be proved. 

1.4 Systems of non-Euclidean geometry 

Nikolai Ivanovich Lobachevsky (1792-1856) and Janos Bolyai (1802-1860) were 
the first people to publish, without any reservations, research on a system of non- 
Euclidean geometry!^ They worked independently!^ Lobachevsky initially set out 
his ideas on what he called "imaginary geometry" in a presentation at the University 
of Kazan in 1826. (The city of Kazan lies on the Volga River in central Russia.) 
Lobachevsky published papers in Kazan journals in 1829 and 1835-37, and in 1837 
a third paper appeared in Journal fiir Mathematik. Lobachevsky also published 
books on his theories, in 1840 and 1855. As for Janos Bolyai, until 1821 he agreed 
with his father Parkas that the parallel axiom must be true. Over the next two 
years, though, Janos investigated the alternative geometry, arriving at many of the 
same results as Lobachevsky. In 1832 Janos published his work on what he called 
"absolute geometry" , in an appendix to his father's book on the theory of parallels. 

In their systems, Lobachevsky and Bolyai rejected the parallel axiom but re- 
garded all Euclidean propositions proved without using this axiom as still being 
valid0 In Lobachevsky 's account if C is a point at a perpendicular distance a 
from the line AB, there exists an angle 7r(a) such that all lines through C which 
make an angle less than 7r(a) with the perpendicular CD will intersect AB; all other 
lines through C do not intersect AB. In the Euclidean sense of the word parallel, 
there are infinitely many parallels through C. 



Other results of Lobachevksky and Bolyai included trigonometry, arc length ds, the 
area of a circle, and theorems on the area of plane regions and volumes of solids. 

■^°Gray, Ideas of Space, p. 97. 

■^^Rosenfeld, History of Non-Euclidean Geometry, pp. 214-215. 
^^Torretti, Philosophy of Geometry, p. 53. 

^•^For this reason I will refer to their geometry, in which the angle sum of a triangle is less 
than TT, as hyperbolic geometry rather than using one or both of their names. 
^''Torretti, Philosophy of Geometry, p. 40. 
^^discusscd in Kline, Mathematical Thought, pp. 874-875. 
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The founders of non-Euclidean geometry were not formalists, and did not con- 
sider their work to be an investigation of an axiomatic system with only one dif- 
ference from that of Euclidjf^ This is shown by their willingness to consider the 
possibility that their geometry was the geometry of real physical space. When 
Lobachevsky wrote 'straight line', he meant straight in the ordinary sense of the 
word, showing that he regarded his geometry as (at least potentially) realjfll Indeed, 
the discovery of these different geometries made it seem possible that Euclidean ge- 
ometry might be proved inexact by experiment, at some point @ Bolyai commented 
that if non-Euclidean geometry were real, a measurement would determine the size 
of the constant the radius of the imaginary sphere. Since Euclidean geometry 
had been used so successfully in physics, if non-Euclidean geometry were indeed 
the geometry of the real world, then any differences would only be detectable on 
a very large scale@ Lobachevsky thus suggested using the parallax of stars to 
decide which geometry held. His measurements were inconclusive. According to 
some sources, GauB measured distances between mountain peaks for the purpose 
of deciding between geometries^ It seems more likely that GauB realised that the 
distances involved were too small, and he actually made these measurements for his 
research in geodesy0 This is certainly suggested by his writing about the constant 
K: "in the light of our astronomical experience, the constant must be enormously 
larger than the radius of the earth" @ 

1.5 Reception of non-Euclidean geometry 

Until the 1860s, the European mathematical community did not regard investi- 
gations into non-Euclidean geometry as respectable, and non-Euclidean geometry 
was separate from the rest of mathematics. Only about 1870 was the work of 
Lobachevsky and Bolyai recognised. There were social reasons for this reluctant 
reception. As well, there were mathematical issues which had to be resolved, and a 
system of alternative geometry raised many philosophical controversies. 

1.5.1 Social factors 

During the nineteenth century mathematics became professionalised, and impor- 
tant mathematical communities developed at centres including Paris, Berlin and 
Gottingen. Bolyai, as a Hungarian army officer, was socially and geographically a 
long way from these mathematical hubs. Lobachevsky's 1829 paper was published 
in a very obscure journal, got bad reviews and was regarded as incomprehensible. 
His article in the mainstream Journal fiir Mathematik was virtually impossible to 
read, and depended on results in the 1829 paper. GauB, at Gottingen, approved 
of Lobachevsky's 1829 paper and Bolyai's 1832 article, but only praised them in 
letters to friends. 

Because of GauB's prestige, when his correspondence was published in the early 
1860s, these comments made a strong impression on the European mathematical 

^^Torretti, Philosophy of Geometry, p. 61. 
37lbid., pp. 62-63. 
38lbid., p. 254. 
39lbid., p. 63. 

^°For example, Kline, Mathematical Thought, pp. 872-873. 
''^Gray, Ideas of Space, p. 123. 

''^Quoted in Torretti, Philosophy of Geometry, p. 63. 
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community, and led to intensive discussion of new geometrical ideas. In 1865, Arthur 
Cayley (1821-1895) published a commentary on Lobachevsky, which did much to 
disseminate his work. In the years 1866 and 1867, French, German, Italian and 
Russian translations and publications of Lobachevsky's work appeared, so that by 
1870 Lobachevsky and Bolyai's research was known to geometers all across Europe. 
From the 1890s, there were university courses and textbooks on non-Euclidean 
geometry, showing that it was well-accepted by the mathematical community. 

1.5.2 Mathematical developments 

Internal developments in mathematics were needed before non-Euclidean geometry 
could be recognised, because in the 1830s there were sound mathematical reasons 
for holding reservations about the new geometry. Neither Lobachevsky nor Bolyai 
proved that their geometry was consistent, that is, that it did not contain any con- 
tradictions. Lobachevsky was convinced that he had established consistency, since 
he had obtained his trigonometric formulae from those of spherical trigonometry by 
multiplying the sides of the triangle by the complex number i. His argument was 
in fact insufficient. It only showed that his trigonometric formulae followed from 
the assumptions he had made in setting up hyperbolic geometry^ 

Models of hyperbolic geometry were critical to overcoming these mathemati- 
cal reservations. The models found by the mathematicians Beltrami, Klein and 
Poincare were structured collections of objects satisfying a set of mathematical 
statements, given a suitable, if unusual, interpretation of the key terms such as 
'straight line'@ The (relative) consistency of non- Euclidean geometry was proved 
using a model, and models aided the acceptance of the new geometry in other ways. 
They provided a method of visualising concepts contrary to intuition, such as the 
existence of infinitely many parallels through a point. Models also made it possible 
to apply non- Euclidean geometry to solving problems in other areas of mathematics. 
This demonstrated its usefulness. 

In the 1830s, Ferdinand Minding (1806-85) had investigated surfaces of con- 
stant negative curvature, such as the pseudosphere, which is obtained by rotating 
a tractrix about a vertical axis. For such surfaces. Minding obtained the same 
trigonometric formulae as in hyperbolic geometry, and published them in 1840. 

Minding's work was largely forgotten until Eugenio Beltrami (1835-1900) no- 
ticed that Minding's trigonometric formulae were identical to Lobachevsky's. He 
thus realised that non-Euclidean plane geometry could be regarded as the geom- 
etry of a surface of constant negative curvature. In his model of 1868, Beltrami 
identified the hyperbolic plane with the interior of a fixed circle in the Euclidean 
plane. Hyperbolic straight lines were identified with open chords of this circle, 
and points at infinity with the circle itself. By constructing this first model of the 
hyperbolic plane, Beltrami established the relative consistency of non-Euclidean 
geometry. This was because his model was defined wholly in terms of Euclidean 
geometry, and he proved, using differential geometry, that it satisfied all the ax- 
ioms of hyperbolic geometry. Thus, the new geometry was consistent if Euclidean 
geometry was consistent!^ 



^■^Rosenfeld, History of Non- Euclidean Geometry, p. 227. 
^^Torretti, Philosophy of Geometry, p. 132. 
John G. RatclifFe. Foundations of Hyperbolic Manifolds. Springer- Verlag, 1994, p. 7. 
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Felix Klein (1849-1925) interpreted Beltrami's model in terms of projective ge- 
ometry in 1871. He modelled hyperbolic geometry as metric projective geometry in 
the interior of some conic. Beltrami's model was the special case where the conic 
was a circle. 

In 1882, Henri Poincare (1854-1912) showed that linear fractional transforma- 
tions with real coefficients preserve the complex upper half-plane. Poincare's model 
is this upper half-plane with arc length defined as ds = ^/dx^~+~dy^ / y, and geodesies 
as vertical lines or semicircles orthogonal to the real axis. Distance is then arc length 
along the geodesic joining two points. Poincare's model allows groups of linear frac- 
tional transformations to be represented as discrete groups of orientation-preserving 
isometries of the hyperbolic plane. The model can be mapped to the unit disc, and 
in later works Poincare extended his model to three dimensions, and discussed a 
two-sheeted hyperboloid model as well. 

Using these models, Poincare found many applications for hyperbolic geometry, 
which helped the new geometry to seem useful and familiarcj Indeed, Poincare's 
work showed that hyperbolic geometry was already part of mainstream mathemat- 
ics0 For example, he used hyperbolic geometry to prove the existence of linear 
fractional transformations. He then studied the group of such transformations, 
using the tesselation of the upper half-plane pictured here. 




-2-1012 



''^Scott Walter. The non-Euclidean style of Minkowskian relativity. In Jeremy Gray, editor, 
The Symbolic Universe: Geometry and Physics 1890-1930, pp. 91-127. Oxford University Press, 
1999, p. 92. 

^^John Stillwell. Introduction to Poincare's 'Theory of Fuchsian groups', 'Memoir on Kleinian 
groups', 'On the applications of non-Euclidean geometry to the theory of quadratic forms'. In John 
Stillwell, editor. Sources of Hyperbolic Geometry, pp. 113-122. American Mathematical Society, 
1996, p. 113 
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Other applications included complex analysis, differential equations and number 
theoryo 

The models and applications of hyperbolic geometry which were so important to 
its acceptance could not have developed without advances in many different areas of 
mathematics in the nineteenth century. These areas included differential geometry, 
group theory, geometric transformations and projective geometry. 

GauB's theory of curved surfaces (1827) was essential to the eventual recognition 
of hyperbolic geometry]^ The language established by the theory of curvature could 
be used to describe or analyse surfaces in terms of their intrinsic properties, that 
is, without consideration of the space in which they were embeddedcj Geodesies 
in surfaces could be defined by analogy with straight lines in the plane 1^ These 
techniques were employed in modelling hyperbolic geometry, as all the models were 
surfaces of constant negative curvature with geodesies defined intrinsically. 

Riemannian geometry was also crucial to the reception of hyperbolic geometry. 
At Gottingen in 1854, G. F. B. Riemann (1826-1866) gave a lecture entitled "Uber 
die Hypothesen, welche der Geometrie zu Grunde liegen" ("On the Hypotheses 
which lie at the Foundations of Geometry"). It was published in 1867. In this 
lecture, Riemann described a very general notion, that of an n-dimensional man- 
ifold. A manifold is characterised by a surface, and a metric chosen from among 
infinitely many possible alternatives. The theory of manifolds included Euclidean 
geometry as a special case, and hyperbolic geometry as an infinite family of cases 
where the surface has constant negative curvatureo By showing that infinitely 
many geometries were possible, and allowing the design of geometries very differ- 
ent from Euclid's, Riemann's work helped get hyperbolic geometry admitted to 
mathematical respectabilityl^ 

Many of the applications of hyperbolic geometry used group theory. The idea 
of a group began with concrete cases, such as permutations of the roots of algebraic 
equations, which were studied by J. H. Lagrange (1736-1813) and Paolo Ruffini 
(1765-1822) in the eighteenth century. In 1826, Niels Henrik Abel (1802-1829) 
used such groups to prove that it was impossible to obtain a solution in radicals of 
the general equation of degree greater than or equal to 5. Augustin Louis Cauchy 
(1789-1857) wrote many papers on the theory of groups of substitutions. The term 
'group' was first used in an article by Evariste Galois (1811-1832), who used it only 
for groups of substitutions. Cayley, in 1854, was the first to formulate the concept 
of an abstract group, by giving an axiomatic definition of a group. Klein brought 
group theory into geometry, by noticing that geometric transformations can form 
groups under composition. 

Although Euclid had rejected the use of motion in geometry, other mathemati- 
cians employed many different kinds of geometric transformations^ Archimedes 



■*^John Stillwell, 'Introduction', pp. 113-114. 

^^John Milnor. Hyperbolic Geometry: the first 150 years. In Bulletin of the American Mathe- 
matical Society, 6:9-24, 1982, p. 10. 
^^Torretti, Philosophy of Geometry, p. 78. 
^^Gray, Ideas of Space, pp. 136-137. 
'^^Torretti, Philosophy of Geometry, p. 40. 
^^Gray, Ideas of Space, pp. 141-145. 

^^Rosenfeld, History of Non-Euclidean Geometry, p. 112-151. 
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(b. 287? B.C.E.) used reflections, and Apollonius (2627-190? B.C.E.) used dilations 
and inversions. Greek and Arabian astronomers applied stereographic projection 
to construct astrolabes and maps. The geometric transformations employed by 
Isaac Newton (1642-1727) included central projection and projective transforma- 
tions. Alexis Claude Clairaut (1713-1765) defined general affine transformations 
in words in 1733, and Leonhard Euler (1707-1783) gave general formulae for affine 
and similarity transformations in 1748, and classified plane motions. Euler also 
used conformal mappings of the plane and linear fractional transformations, which 
define transformations generated by similarities and inversions in circles. A. F. 
Mobius (1790-1860) introduced the general concept of a geometric transformation 
as a one-to-one correspondence between figures. 

In his Erlangen Program of 1872, Klein described geometry as the study of those 
properties of figures which are preserved by a particular group of geometric trans- 
formations. Geometries with non-trivial symmetries, such as projective geometries 
and metric spaces with constant curvature, could now be characterised by classi- 
fying groups of transformations^ Two geometries were equivalent if their char- 
acteristic groups were isomorphic. For example, Euclidean geometry in the plane 
is the study of those properties of figures, such as angles and the ratios between 
lengths, which remain invariant under the group of similarities of the plane, which 
includes all translations, dilations and rotations. Klein's Erlangen Program was at 
first little noticed, but achieved international infiuence by the end of the nineteenth 
centuryF^ As a result of this program, the study of some geometries became the 
study of transformation groups0 Sophus Lie (1842-1899), for example, used this 
idea to characterise all three-dimensional geometries by determining all subgroups 
of motions in three-dimensional space. Hyperbolic geometry was now just one of 
many alternatives which could be investigated using algebraic techniques, and was 
a source of applications of group theory. 

Projective geometry, with its roots in antiquity, was largely developed in the 
nineteenth century. Depending on the projective metric chosen, the requirements 
of spherical, Euclidean or hyperbolic geometry can be met. So, as Klein discovered, 
projective metric spaces can model hyperbolic geometry!^ 

1.5.3 Philosophical disputes 

The philosophical controversy about non-Euclidean geometry was extensive. Since 
the Greeks, Euclidean geometry had been considered the correct representation 
of space, which accorded with intuition and possessed admirable logical structure. 
The discovery of non-Euclidean geometry led to debates on issues including the 
relationship between mathematics and the real world, and the source of geometric 
notions. Changes to the ideas held in these areas were necessary for non-Euclidean 
geometry to be accepted, while non- Euclidean geometry itself inspired investigations 
into the foundations of mathematics. 



^^John D. Norton. Geometries in collision: Einstein, Klein and Riemann. In Gray, editor, The 
Symbolic Universe, pp. 128-144, p. 129. 

^^Boyer, History of Mathematics, p. 549. 
Jeremy Gray. Geometry — formalisms and intuitions. In Gray, editor, The Symbolic Universe, 
pp. 58-83, p. 62. 

^^Torretti, Philosophy of Geometry, p. 110. 
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One of the basic issues raised by non-Euclidean geometry was the connection 
between geometry and physical space. In classical physics, Euclidean geometry was 
held to be the science of space, a view which implied that this geometry really 
existed!^ The ontological status of geometry was also linked to debates over the 
source of geometric knowledge. If our geometric notions are inborn, a priori, then 
geometry really exists. Further, our intuition must be obeyed, and so the counter- 
intuitive nature of many non-Euclidean results is a sound argument against them. 
If, on the other hand, geometry is an abstraction of the physical world, then Eu- 
clidean geometry is just one of many possible human constructs. It may well be the 
idealisation which is favoured by experience, but this does not imply that it really 
exists. 

Disputes about the source of mathematical knowledge went back to the Greeks 
In Plato's philosophy, mathematical knowledge was a priori. Geometric concepts 
were particular cases of the Forms, those eternal perfect objects which are the 
template for physical things, and to which we have access before we are born. To 
prove these ideas, Plato describes in Meno a conversation between Socrates and a 
slave-boy. After some leading questions, the boy proves a result about the square 
of the hypotenuse of a right-angled triangle. In the dialogue, Socrates concludes 
. . . now these opinions have been newly aroused in this boy as if in a 
dream, but if someone asks him these same things many times and in 
many ways, you can be sure that in the end he will come to have exact 
knowledge of these things as well as anyone else does ... he will come to 
have knowledge without having been taught by anyone, but only having 
been asked questions, having recovered this knowledge himself, from 
himself^ 

The opposing view, put by Aristotle, was that geometric concepts are abstractions 
of real world objects: a mathematician "effects an abstraction, for in thought it is 
possible to separate figures from motions" @ Whether they regarded it as inborn 
knowledge or an abstraction, the Greeks considered Euclidean geometry to correctly 
represent the physical world. 

The idea of space as an empty receptacle which is then occupied by matter was 
not familiar to Greek philosophers. The classical Greek words topos (place) and 
kenon (void) do not correspond to the word space, since place is the place of a body, 
which is determined by its relationship with other nearby bodies, while a void is 
always filled and is finite]^ The modern concept, though, was current in Europe 
by the fifteenth century. Natural philosophers such as Giordano Bruno (1548-1600) 
held that space was infinite and existed independent of matter. Bruno wrote 
Space is a continuous three-dimensional natural quantity, in which the 
magnitude of bodies is contained, which is prior by nature to all bodies 
and subsists without them but indifferently receives them all, and is free 
from the conditions of action and passion, unmixable, impenetrable, 

^^Torretti, Philosophy of Geometry, p. 25. 

^°Rosenfeld, History of Non-Euclidean Geometry, pp. 186-187. 

**ipiato. Meno. Aris & Phillips Publishers, 1985, p. 79. 

^^Quoted in Rosenfeld, History of Non-Euclidean Geometry, p. 186. 

^'^Torretti, Philosophy of Geometry, pp. 25-26. 
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unshapeable, non-lo cat able, outside all bodies yet encompassing and 

incomprehensibly containing them all@ 
Once this concept of space was part of philosophical discourse, the dispute about 
the source of geometry embraced the origin of our ideas of space as well as of 
more specific geometrical notions. Immanuel Kant (1724-1804) is important for his 
association with the doctrine that our concept of space is a priori. His major work 
Kritik der reinen Vernunft (Critique of Pure Reason) (1781) presented his theories 
of time and space. Here, Kant wrote that 

1. Space is not a conception which has been derived from outward ex- 
periences. For, in order that certain sensations may relate to something 
without me . . . the representation of space must already exist as a foun- 
dation. . . . 

2. Space then is a necessary representation a priori, which serves for 
the foundation of all external intuitions!^ 

As for geometry, this was discussed by Kant in the Kritik, and in a chapter on pure 
mathematics in the Prolegomena (1783), a work written as a survey of the main 
points in the Kritik. Kant argued that geometry was the true science of space, that 
is, of space the a priori representation, and that geometry was revealed by pure 
intuition in full agreement with Euclid's Elements]^ Because of these doctrines, 
followers of Kantian orthodoxy in the nineteenth century opposed non-Euclidean 
geometry, dismissing it as an intellectual exercise unrelated to the real world 

Much of classical physics depended on Euclidean geometry@ Johann Kepler 
(1571-1630) regarded geometry as the foundation of both celestial and terrestrial 
physics, writing 

We see that the motions [of the planets] occur in time and place and 
that the force [that binds them to the sun] emanates from its source 
and diffuses through the spaces of the world. All these are geometri- 
cal things. Must not that force be subject also to other geometrical 
thingsQ 

The coordinate geometry of Rene Descartes (1596-1650) was crucial to the status 
of geometry in physicsllj With the Cartesian coordinate system, space could be 
equated with a structured set of points and, importantly, motion could be analysed. 

Gottfried Wilhelm Leibniz (1646-1716) and Isaac Newton are considered the 
founders of mathematical physics. They had differing views of space. Leibniz (like 
Descartes) considered it impossible for space to be empty. In Nouveaux essais sur 
I'entendement humain (New Essays on Human Understanding), Leibniz wrote that 
"it is necessary rather to conceive space as full of a matter originally fluid, suscep- 
tible of all the divisions" Newton, however, thought that space was independent 
from matter. In the Principia (1687), he defined absolute space as: 



^''Quoted in ibid., p. 28. 

^^Quoted in Rosenfeld, History of Non-Euclidean Geometry, p. 187. 

^^Torretti, Philosophy of Geometry, p. 33. 

s^Ibid. 

s^Kline, Mathematical Thought, pp. 880-881. 
^^Quoted in Torrctti, Philosophy of Geometry, p. 23. 
™Torretti, Philosophy of Geometry, p. 34. 

''^Quoted in Rosenfeld, History of Non- Euclidean Geometry, p. 184. 
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Absolute space, in its own nature, without relation to anything external, 
remains always similar and immovable!^ 
Absolute space was essential to the results of the Principia, and it was Newton's 
ideas on the nature of space which prevailed as theoretical mechanics was developed 
in the eighteenth century. The implicit assumptions about space made by Newton 
and later mechanists included that it was continuous, infinite, three-dimensional 
and homogeneous, and that it satisfied Euclidean theorems!^ 

Because physics depended on Euclidean geometry, undermining the parallel ax- 
iom was a real physical problem^ There were sound reasons to think that physical 
space was Euclidean. How then could an alternative geometry exist? The word 
'existence' here was taken to mean true physical existence, and this is certainly the 
interpretation of 'existence' held by Bolyai and Lobachevsky. Apart from the paral- 
lel axiom, they never questioned Euclidean geometry, which Bolyai described as "the 
absolutely true science of space" @ Concrete notions of existence definitely ham- 
pered the acceptance of non-Euclidean geometry: contemporaries could discount 
Lobachevsky and Bolyai's work because neither author succeeded in establishing 
the physical existence of their geometryQ 

The development of Riemannian geometry was very important in changing 
mathematicians' ideas about the nature of mathematical existence. Riemann be- 
gan his 1854 lecture on the foundations of geometry by observing that all previous 
presentations of geometry had presupposed the nature of space. He then described 
space, meaning real physical space, as just one instance of a general concept which 
he called a mehrfach ausgedehnte Grosse (multiply extended quantity) Riemann 
showed that such an extended quantity (in modern terminology, a manifold) may 
have arbitrarily many "metric relations" ; therefore "space constitutes only a special 
case of a threefold extended quantity" |3 The implication of Riemann's geometry 
for the philosophy of space was that, since there can be only one geometry of real 
space among infinitely many mathematical possibilities, the science of space could 
not be determined by mathematical processes alone. As Riemann said, "those prop- 
erties which distinguish space from other conceivable threefold extended quantities 
can be gathered only from experience" @ Riemann acknowledged that Euclidean 
geometry was empirically successful within "the limits of observation" but his 
work meant that mathematical existence could no longer be equated with physical 
existence. This was a very significant change in mathematical discourse. 

Because each possible manifold had equal mathematical validity, Euclidean and 
non-Euclidean geometries could now exist side by side. Indeed, Beltrami's model 
showed that the consistency of Euclidean geometry implied the consistency of hy- 
perbolic geometry. The models of hyperbolic geometry, such as Beltrami's, also 



''^Quoted in ibid., p. 185. 

'''^Torretti, Philosophy of Geometry, p. 25. 

■^^Kline, Mathematical Thought, pp. 880-881. 

^^Quoted in Torretti, Philosophy of Geometry, p. 62. 

^^Gray, Ideas of Space, p. 170. 

''^Torretti, Philosophy of Geometry, p. 83. 

''^Quoted in ibid., p. 83. 

^^Quoted in ibid., p. 84. 

^"Quoted in ibid., p. 104. 
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represented a move towards semantic neutrality. With words such as 'straight' be- 
ing interpreted in ways differing from their everyday meaning, physical existence 
became irrelevant. Poincare, in the spirit of the Erlangen Program, characterised 
geometries using groups, and then explained that alternative geometries can ex- 
ist because the existence of one group is not incompatible with the existence of 
another jf^ 

The development of projective geometry further aided the acceptance of hyper- 
bolic geometry. Since projective geometry was so counter-intuitive, it was quickly 
characterised by axioms, a project which was completed in 1882 by Moritz Pasch 
(1843-1930)@ Objections to non-Euclidean geometry based on intuition had little 
weight once mathematics accepted systems as strange as projective geometry. 

By the late nineteenth century, mathematicians made no claims that their work 
represented the physical truth. For example, in his popularising work of 1912, La 
science et I'hypothese, Poincare wrote that the question of whether Euclidean ge- 
ometry was true had no meaning. He dismissed the argument that geometric ideas 
are a priori by pointing out that alternative systems, such as projective geometry, 
are conceivableo Also, he argued, geometry could not be a body of experimental 
truths, since if it were experimental it would not be exact, and would be subject 
to continuous revision^ Thus the choice of which geometry to use in representing 
physical space was a matter of convention, with experience guiding but not de- 
termining our choice of the most convenient systemcj Poincare concluded: "One 
geometry cannot be more true than another; it can only be more convenient" @ 

This attenuation of mathematical truth-claims meant that the mathematical 
notion of existence was very much weaker than that of physics. In fact, over the 
course of the nineteenth century, it became for the first time possible, then necessary, 
to distinguish between the subjects of mathematics and physics, as they developed 
quite distinct institutions and aimsjfll The leading physicist Ludwig Boltzmann 
(1844-1906), for instance, considered mathematics to be unrelated to the real world, 
and to have only a service role in physics@ The writings of Ernst Mach (1838- 
1916) were very influential in physics. His notion of existence was stronger than 
that prevailing in mathematics, and he wrote of nineteenth-century geometry: 

Analogues of the geometry we are familiar with, are constructed on 
broader and more general assumptions for any number of dimensions, 
with no pretension to being regarded as more than intellectual scientific 
experiments and with no idea of being applied to reality. . . . Seldom 
have thinkers become so steeped in reverie, or so far estranged from 
reality, as to imagine for our space a number of dimensions exceeding 
the three of the given space of sense, or to conceive of representing that 



Arthur Miller. Einstein, Poincare, and the testability of geometry. In Gray, editor, The 
Symbolic Universe, pp. 47-57, p. 54. 
^^Torretti, Philosophy of Geometry, p. 190. 

^•^Henri Poincare. Science and Hypothesis. Dover Publications Inc., 1952, p. 48. 
^Hhid., pp. 49-50. 
^^Ibid., p. 50. 
s^Ibid. 

Jeremy Gray. Introduction. In Gray, editor, The Symbolic Universe, pp. 1-21, p. 3. 
s^Ibid., p. 4. 
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space by any geometry that appreciably departs from the Euchdean. 

Gauss, Lobachevsky, Bolyai and Riemann were perfectly clear on this 

point, and certainly cannot be held responsible for the grotesque fictions 

subsequently stated in this field@ 
It is interesting, then, that in twentieth-century physics, space is Riemannian and 
not Euclideancj Einstein's theories of special and general relativity depend on ge- 
ometry quite different from the three-dimensional Euclidean geometry of classical 
physicscj Indeed, from 1908 to 1916, the laws of special relativity were reformu- 
lated and reinterpreted in terms of hyperbolic geometry]^ Even so, non-relativistic 
physics still employs only one geometry to represent physical space, while mathe- 
maticians consider a whole variety of geometries. 

Non- Euclidean geometry stimulated discussion of the foundations of mathemat- 
ics. Works responding to the new geometries began to appear in the 1860s. Rigorous 
axiomatisation was their aim, and they embodied the new, weaker notion of exis- 
tence, and a lesser role for intuition. Lie, for example, published two papers on the 
foundations of geometry in 1890. In these, he asked what properties were necessary 
and sufficient to define Euclidean geometry and the non-Euclidean geometries of 
constant non-zero curvature. Thus Lie considered the problem of space as an issue 
of pure mathematics only, showing the divide between mathematics and ideas about 
physical space @ 

The investigations into foundations culminated in 1899 with the publication of 
Grundlagen der Geometrie by David Hilbert (1862-1943). This was a complete 
axiomatisation of geometry, which rectified the logical inadequacies of Euclid's El- 
ements. For Hilbert, a geometry was defined by a set of unproved axioms, any of 
which might be negated to obtain a different theory. The axioms contained unde- 
fined primitive terms, which could be modelled as referring to particular objects 
such as points, lines and planes, restricted only by the implications of the axioms. 
Such a level of formalism further weakened claims as to the truth of geometry@ At 
the end of the nineteenth century, Euclidean and hyperbolic geometry had become 
two abstract axiomatic systems, differing in only one axiom, and equally consistent. 
This philosophy was very different from that of Bolyai and Lobachevsky, but their 
work contributed to ito 

1.6 Conclusion 

This history shows how the development and reception of non-Euclidean geome- 
try were associated with fundamental changes in the way mathematics is done and 
thought about. The recognition and study of abstract mathematical spaces, and 
the realisation of the connection between geometry and groups, are stages in this 
history which are of particular relevance for subsequent chapters. Indeed, the style 



^^Quoted in Rosenfeld, History of Non-Euclidean Geometry, p. 203. 

^°Gray, 'Geometry', p. 80. 

^^Miller, 'Einstein, Poincare', p. 55. 

^^Walter, 'Non-Euclidean style', p. 91. 

^•^Torretti, Philosophy of Geometry, p. 154. 

^'^Gray, 'Geometry', p. 69. 
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^Torretti, Philosophy of Geometry, p. 61. 
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of exposition adopted for the rest of this thesis, with exphcit definitions, acknowl- 
edgement of assumptions and so forth, illustrates well the abstraction and logical 
precision of modern mathematics. The shift in conceptions of geometry which re- 
sulted in this modern approach was stimulated by the discovery of non-Euclidean 
geometry, and also aided the acceptance of the new geometry by mathematicians. 
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Chapter 2 

Hyperbolic Geometry 



2.1 Introduction 

In this chapter we survey hyperbohc geometry. We provide a modern treatment of 
much of the geometry discussed in Chapter 1, and lay the foundations for Chapters 
3 and 4. After a section of prehminary definitions and results, we consider four 
models of hyperbolic geometry. Hyperbolic geometry is actually a continuum of ge- 
ometries, each distinguished by its Gaussian curvature. For convenience, the models 
of hyperbolic geometry presented here all have constant curvature of —1. The first 
model discussed is called the hyperboloid model, because the hyperbolic plane in 
this model is one sheet of a two-sheeted hyperboloid. In general, the hyperboloid 
model is the sphere of radius i in a non-Euclidean space called Lorentzian n-space. 
Lorentzian 4-space is the model of space-time in special relativity. Next is a brief 
treatment of Beltrami's model, which was reinterpreted by Klein, and is also known 
as the projective disc model. The model introduced by Poincare was the upper half- 
plane, and he showed how this model could be mapped to the unit disc. Poincare's 
models have now been generalised to higher dimensions, and are known respectively 
as the upper half-space model, and the conformal ball model. We will prove that, in 
the upper half-space and the conformal ball, hyperbolic isometrics may be identified 
with compositions of finitely many reflections in planes and spheres. In the upper 
half-plane, we go on to consider how tesselation by triangles is related to the action 
of the discrete group SL{2, Z). Most of the material in this chapter is selected and 
adapted from the treatment of hyperbolic geometry in Chapters 3-6 of [21]. The 
proofs of relevant exercises in |21j are provided. Some additional results which will 
be needed in later chapters are estabhshed. 

2.2 Geometric preliminaries 

This section comprises useful definitions and results relating to Euclidean n-space 
and to general metric spaces. The geometric transformations discussed include 
reflections in planes and spheres. Compositions of finitely many such reflections 
are called Mobius transformations, and Mobius transformations turn out to be very 
important for understanding hyperbolic geometry. 

2.2.1 Euclidean n-space 

We define Euclidean n-space, to be the metric space consisting of M" together 
with the distance function 

c?£;(x,y) = |x - y|. 
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Here, the right-hand side is the Euchdean norm, |x| — (x • x)^/^. The Euchdean 
inner product is the usual dot product given by 

X • y = XiVi + X2y2 H \- XnVn- 

We are referring to as a distance function rather than as a metric in order 
to avoid confusion with Riemannian metrics in Chapter 3. The Cauchy-Schwartz 
inequahty for x, y e 

|x-y| < |x||y| 

with equahty if and only if x and y are linearly dependent, allows us to define 
^(x, y), the Euclidean angle between non-zero vectors x and y, by 

x-y = |x||y| cos6'(x,y). 

2.2.2 Geometric transformations and group actions 

An isomctry from a metric space {X, dx) to a metric space (F, dy) is a bijection 
(j) : X such that 

dx{xi,X2) = c?y(0(a;i),0(a;2)) 

for all xi,X2 G X. The set of isometrics from a metric space X to itself, denoted by 
/(X), forms a group under composition. Isometries of E'^ are known as Euclidean 
isometries. 

A function : M" — > M" is called an orthogonal transformation if 

0(x) ■ 0(y) = X • y 

for all X, y G M". Let {ei, e2, . . . , e„} be the standard basis of M". Then, it 
can be proved that a map : is an orthogonal transformation if and 

only if (p is linear and {0(ei), ^(62), . . . , 0(e„)} is an orthonormal basis of M". It 
follows that every orthogonal transformation is a Euclidean isometry, and that 
orthogonal transformations may be identified with orthogonal n x n matrices. The 
set of all orthogonal n x n matrices, denoted by 0{n), forms a group under matrix 
multiplication. 

A group G is said to act on a set X if there exists a function from G x X to X, 
written (^f, x) ^ gx, such that for all g,h & G, and all x & X, we have 

Ix — X and g{hx) — {gh)x. 

For example, the group I{X) acts on X, and the group 0{n) acts on the set of 
m-dimcnsional vector subspaces of R"', for 1 < m < n. The action of G on X is 
said to be transitive if for each x,y E X there is a, g E G such that gx = y. 
Proposition 2.2.1. For each dimension m, the action of the group 0{n) on the 

set of m- dimensional vector subspaces o/M" is transitive. 

Proof. Let V be any m-dimensional subspace of W\ Identify M™ with the span of 
the set {ei, 62, ... , e^} in M". Since 0{n) is a group, it suffices to show there is an 
A e 0{n) such that = V. 
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Choose an orthonormal basis {wi, W2, . . . , w^} for V. Then, using the Gram- 
Schmidt process, extend it to an orthonormal basis {wi, W2, . . . , w„} for M". Let A 
be the nxn matrix which has wi, W2, . . . , w„ as its columns. Then A is orthogonal, 
and A{W) = V. □ 

An m-plane of E"^ is defined to be a coset a + V, where a G -E" and V is an 
m-dimensional subspace of 

Corollary 2.2.2. For each dimension m, the group I{E'^) acts transitively on the 
set of m-planes of E"". 

Proof. Let sl + V and b + 14^ be m-planes of E^. By Proposition 12.2.11 there is an 
A e 0{n) such that A{V) = W. Define the map : ^ by 

0(x) = (b - Aa) + Ax. 

Then 0(a + 1/) = b + W, and 

|0(x) -0(y)| = \Ax-Ay\ = |x-y|, 

since A is an isometry. Hence is an isometry. □ 

The following proposition shows that every Euclidean isometry is the composi- 
tion of an orthogonal transformation and a translation. 

Proposition 2.2.3. The function cf) : i?" i?" is an isometry if and only if cf) is 
of the form 

0(x) = a + Ax, 
where A is an orthogonal matrix and a = 0(0). 

Proof. Suppose is an isometry. Define a map A by A(x) = 0(x) — 0(0). Then 
A(0) = 0, and, since preserves Euclidean norms, 

|A(x)-A(y)| = |0(x)-0(y)| = |x-y| 
for all X, y G -E". Therefore, if x G -E", 

|A(x)| = |A(x)-A(0)| = |x-0| = |x|, 

so A preserves Euclidean norms. It follows that A is orthogonal, since 

2(Ax ■ Ay) = |Ax|2 + |Ay|2 - |Ax - Ayj^ 
= |x|2 + |y|2- |x-y|2 
= 2(x-y). 

So we have an orthogonal matrix A such that 0(x) = 0(0) + Ax. 

Conversely, suppose is of the form 0(x) = 0(O)+y4x. Then is the composition 
of an orthogonal transformation and a translation, both of which are isometrics. 
Therefore is an isometry. □ 
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A function cj) : X ^ Y between metric spaces {X,dx) and {Y^dy) is called a 
similarity if it is a bijection, and there is a scale factor > such that 

dx{xi,X2) = kdY{(i){xi),(f){x2)) 

for all xi,X2 G X. The set of similarities from a metric space X to itself, denoted 
by S{X), forms a group under composition. The isometries of X are a subgroup of 
S{X). Under transformation by elements of all the theorems of Euclidean 

geometry remain true. Thus the similarities of are its characteristic group, in 
the sense used by Klein in his Erlangen Program. The following characterisation of 
similarities follows from Proposition 12.2.31 

Proposition 2.2.4. The function (p : E"- —>■ E"- is a similarity if and only if (j) is 
of the form 

0(x) = a + fcy4x, 

where A is an orthogonal matrix, k is a positive constant and a = 0(0). 

2.2.3 Geodesies 

Let (X, d) be a metric space, and let a : [a, 6] ^ X be a continuous injection, where 
a < 6 in M. We say a is a geodesic curve if, for all s, t G [a, 6], 

a{t)) = \s — t\. 

For points x and y in X, we denote by [x,y] the image in X of a geodesic curve 
a : [a,b] X such that a{a) = x and a{b) = y. We call [x,y] a geodesic segment. 
Then [x, y] U [y, z] is a geodesic segment joining x to z if and only if 

d{x,y) +d{y,z) = d{x,z). 

A geodesic line is defined to be a function A : M — > X which is locally distance- 
preserving. This means that for any x G A(M), with x = A(t), there is a set [a, 6] C M 
containing t such that A restricted to [a, b] is a geodesic curve. A geodesic is the 
image in X of a geodesic line A : M ^ X. 

It turns out that, if the metric space (X, d) satisfies the axioms of Euclidean or 
hyperbolic geometry, then the shortest path from x to y in X is along the unique 
geodesic segment joining x and y, and the length of this shortest path is d{x,y). 

2.2.4 Reflections 

Let u be a unit vector in E"", and let t be a real number. The hyperplane of E"- 
with unit normal vector u passing through the point tu is the set 

P(u,t) = {xG ^" : u-x = t}. 

We will sometimes refer to hyperplanes as just planes. The reflection p of E"- in the 
plane P{u,t) is given by 

p(x) = X + 2{t — u ■ x)u. 
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Then p(x) = x if and only if x G P{u,t), and p^(x) = x for all x G E"'. We now 
show that every Euclidean isometry is a composition of finitely many reflections in 
hyperplanes. 

Lemma 2.2.5. Let x, y be in E"' and p be the reflection in the plane P{u,t). Then 

IpW -p(y)| = |x-y|. 
That is, p is a Euclidean isometry. 

Proposition 2.2.6. Every isometry of E"' is a composition of at most n + 1 reflec- 
tions in hyperplanes. 

Proof. Let : i?" ^ E"^ be an isometry. We construct reflections Pq, pi, . . . , pn such 
that = poPi ■■■ Pn- 

Set vq = 0(0). If vq = let po be the identity, and otherwise let po be the 
reflection in the plane P (vo/|vo|, |vo|/2). Then 

Po(vo) = Vo + 2 — . — — - — - = 0, 

V 2 |vo| J |vo| 

and so po0(O) = 0. Let 0o = Po4>- Then the composition 0o is an isometry, and 
0o(x) = Po0(O) + po0(x), so by Proposition 12.2.31 0o is orthogonal. 

For k = 1,2, . . . ,n, assume inductively that (pk-i is an orthogonal transformation 
of E'"' which fixes and the basis vectors ei, e2, . . . , e^.i. Let be the vector 
4>k-i{^k) — ^k- If Vfc = let pk be the identity, and otherwise let pk be the reflection 
in the plane P (vfc/|vfc|, 0). Then, by using the assumption that (pk-i is orthogonal 
to expand out pk4'k-i{^k) , we find that Pk<Pk~i fixes e^. Also, for 1 < j < A; — 1, 

Vfc ■ = (0fc-i(efc) - Gfe) ■ 
= 0fe_i(efc) ■ ej 
= 0fe-i(efc) ■ (pk-ii^j) 

e^j • Gj 
= 0. 

Therefore, for 1 < j < — 1, the vector ej is in the plane P (vfc/|vjt|, 0), hence 
is fixed by the reflection p^. Consequently, (pk = Pk(pk-i fixes the basis vectors 
ei, 62, ... , Bfc. Also, since is in the plane P (vfc/|vfc|, 0), pk fixes 0. By induction, 
then, the maps po,pi, . . . ,p„ are each either the identity or a reflection, and the 
composition p„ ■ ■ ■ pipo0 fixes 0, ei, e2, . . . , e„. Using Proposition 12.2.31 again, we 
infer that Pn ■ ■ ■ PiPo0 is an orthogonal, hence linear, map which fixes all the basis 
vectors, and so is the identity. Since each reflection is its own inverse, this means 
that poPi ■■■pn = 4>- □ 

We now discuss reflections in spheres. These maps are also known as inversions. 
For a G E^ and r > 0, we denote by ^(a, r) the sphere with centre a and radius r. 
The reflection a of the set E"^ \ {a} in the sphere S{sl, r) is given by 

/ \ 9 X a 

a(x) = SL + r^- -. 

|x — a|2 
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Then o"(x) = x if and only if x G S{a,r), and (T^(x) = x for all x 7^ a. Abusing 
notation, we will say that a is the reflection of E"- in S{a, r). Although a is not an 
isometry, we do have the following identity 

Lemma 2.2.7. Let a be the reflection of in the sphere S{a, r) . For all x, y 7^ a, 

|(7(x) -(7(y)| = 

We will sometimes denote by S'^"^ the sphere S{0, 1) in i?". 

Reflections in planes and spheres are continuous maps with respect to the usual 
metric topology of M", which is that induced by the distance function (Ie- They 
are also differentiable functions. The following result shows that reflections in hy- 
perplanes and spheres preserve angles. Let Q be an open subset of and let 
: Q — >■ E^ be a differentiable function. We say is conformal if (f) preserves angles 
between differentiable curves in Vt. An important class of conformal maps is the set 
of orthogonal matrices. These are conformal because they preserve the Euclidean 
inner product. 

Proposition 2.2.8. Every reflection of E"' in a hyperplane or sphere is conformal. 

Proof. Let p be the reflection of E"- in the plane P{u,t). As p(x) = x + 2(t — u-x)u, 
the j*'^ component of p(x) is xj + 2{t — {uiXi + U2X2 + • • • + UnXn))uj. Thus 

Dp{x) = {Sij - 2uiUj) = 7 - 2C/, 

where Dpipi) is the matrix of partial derivatives of p(x), and U is the matrix {uiUj). 
As -Dp(x) is independent of t, we may assume without loss of generahty that t — Q. 
Then p(x) = (/ - 2C/)x, and 

p(x) • p(y) = (x - 2(u ■ x)u) ■ (y - 2(u ■ y)u) 

= X ■ y - 4(u ■ x) (u • y) + 4(u • x) (u ■ y) (u • u) 
= xy, 

so p is an orthogonal transformation. Therefore / — 2U is an orthogonal matrix, 
and so the reflection p is conformal. 

For reflections in spheres, we consider first the reflection in the sphere S'(0, r). 

We have (Tr(x) = r^x/|xp, so the j^^ component of (Tr(x) is r'^Xj/ {xl + x1-\ 

Hence, 
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where U is the matrix {xiXj/\x.\'^). From above, / — 2U is orthogonal, as x/|x| is a 
unit vector. Thus cr^ is conformal. 

Now let (7 be the reflection in the sphere S'(a, r) and let r be translation by a. 
Then a — rarT'^. Since r and (T^ are conformal, this means a is conformal. □ 

2.2.5 Mobius transformations 

We now consider the one-point compactification of E^\ E^^D {cxo}, which we denote 
by E'^. The reflections p and a above may be extended to be maps from E'^ to 
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E"', by setting p{oo) = oo, (j(a) = oo and a{oo) = a. A sphere of is defined 
to be either a Euchdean sphere S{a,r), or an extended plane P{u,t) U {oo}. For 
simphcity of notation, we will also use P(u, t) to denote extended planes. In the 
case = 2, we may identify E"^ with the Riemann sphere C. 

A Mobius transformation of E"- is a finite composition of refiections in spheres of 
E^. The set of all Mobius transformations of E"', denoted by M{E"'), forms a group 
under composition. By Proposition 12.2.61 any Euclidean isometry may be written as 
a finite composition of refiections, so we may by extending these refiections regard 
the group of Euclidean isometrics to be a subgroup of M{E'^). From the properties 
of refiections, Mobius transformations are continuous, differentiable and conformal. 

We now discuss the action of M{E"') on spheres. A Mobius transformation of 
particular importance is the refiection of E^ in the sphere S'(e„, a/2). Identify E'^~^ 
with X {0} in E"^. The stereographic projection vr of E"^'^ onto the sphere 

S*""^ is defined to be the projection of x G E"'^^ towards (or away from) the vector 
e„ until it meets the sphere 5*""^. Let a be the refiection of E^ in the sphere 
S{en, "\/2)- Then, by comparing the explicit formulae for vr and a, it can be shown 
that the restriction of a to is stereographic projection vr : E^~^ S^~^. 

Lemma 2.2.9. Let a he the reflection of E^ in the sphere S{a,r), and let oi he the 
reflection of E^ in the sphere S{0, 1). Define a map r : E"' — >• E"' hy r(x) = a + rx. 
Then a = rair^^. 

Proposition 2.2.10. The group M{E^) acts transitively on the set of spheres of 

Proof. We first need to show that M{E^) maps spheres to spheres. Let be a 
Mobius transformation and let E be a sphere of E"". As is a composition of 
refiections, we may assume that is a refiection. Suppose first that is a refiection 
in an extended plane. Since is an isometry, by Corollary l2.2.2[ maps hyperplanes 
to hyperplanes, and by the definition of a sphere maps spheres to spheres. Now 
suppose that is the refiection in the Euclidean sphere S'(a, r), and let r be the 
same map as in Lemma 12.2.91 By Proposition I2.2.4[ r is a Euclidean similarity, and 
so r maps spheres to spheres. We may thus, by Lemma 12.2.91 assume without loss 
of generality that is the refiection in the sphere S'(0, 1), that is, 0(x) = x/|xp. 

We may characterise the spheres of E'^ by a vector (oq, ai, . . . , a„+i) in ]R"+^, 
called a coefficient vector, which is unique up to multiplication by a non-zero scalar. 
The equations for the sphere S'(a, r) and the extended plane P(a, t) in E"- are 
respectively 

|xp -2a-x+ |a|^ -r^ = and - 2a ■ x + 2t = 0. 

These may be written in the common form 

ao|xp — 2a ■ X + a„+i = 0, (2.1) 

with |ap > a^an+i. Conversely, any vector (oq, ai, . . . , a^+i) G M""*"^ such that 
|ap > aoa„+i determines a sphere of i?" satisfying equation (12.11) . 
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Now let (ao, cii, . . . , fln+i) be a coefficient vector for S. Then x G S satisfies 
equation fl2.ip . and so if y = 0(x), by the formula for 0, the vector y satisfies 

ao - 2a - y + a„+i|yp = 0. 

This is the equation of another sphere, say S'. Hence maps S into S'. Since is 
its own inverse, the same argument shows that maps S' into S. Thus, since is 
a bijection, 0(S) = S'. We have shown that M{E"') acts on the set of spheres of 

To show that this action is transitive, let S be a sphere of E^. It suffices to prove 
there is a Mobius transformation such that 0(S) = E"'~^. By Corollary \2.2.2\ 
the group of Euclidean isometries acts transitively on the set of hyperplanes of E"". 
Since Euclidean isometries are Mobius transformations, this completes the proof 
if E is a hyperplane. We may thus assume that S is a Euclidean sphere. Now, 
the group of Euclidean similarities acts transitively on the set of spheres of E^. 
Let be the similarity which maps S to 5""^. Then by Proposition I2.2.4t has 
the form ip{x) = ip{0) + kAx, for some orthogonal matrix A and some k > 0. 
The map x i-^ A;x is the composition of the reflection in 5(0, 1) followed by the 
reflection in ^(O, Vk), and so is a Mobius transformation. Since A is orthogonal, 
X I—*- Ax is an isometry, and so is also a Mobius transformation. Hence ■0 is the 
composition of an isometry (translation) and a Mobius transformation. Therefore 
-0 is a Mobius transformation as well. We may now assume that E = S"'~^. Let a 
be the reflection in the sphere S'(e„, a/2). Then a is stereographic projection, so we 
have (j(5'"^^) = and the proof is complete. □ 

The proof of the following important lemma is omitted for reasons of brevity. 
See [21] • 

Lemma 2.2.11. Let (j) he a Mobius transformation which fixes every point of a 
sphere S of E^ . Then is either the identity or the reflection in S. 

2.3 The hyperboloid model 

We first establish useful definitions and results in Lorentzian n-space. Then, we 
define the hyperboloid model and show that it is a metric space. A discussion of 
its geodesies and trigonometry completes this section. 

2.3.1 Lorentzian n- space 

Let X and y be vectors in M", where n > 1, and consider the symmetric bilinear 
form 

X O y = -Xiyi + X2y2 H h Xnyn- 

This form is known as the Lorentzian inner product, although it is not positive def- 
inite. The space of together with o is called Lorentzian n-space. The Lorentzian 
norm derived from the Lorentzian inner product is ||x|| = (xox)^/^. Then, ||x|| may 
be positive, zero, or positive imaginary. If ||x|| is imaginary, we denote its modulus 

by III X III . 

Using the Lorentzian norm, the vectors in M" may be partitioned into space-, 
light- and time-like sets (these labels derive from the use of Lorentzian 4-space as a 
model for special relativity). Space-like vectors are defined to be those with positive 
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Lorentzian norm, and light-like vectors those with Lorentzian norm zero. A vector 
X is time-like if ||x|| is imaginary, and is positive or negative time-like as > or 
Xi < 0. In R^, the light-hke vectors are the cone xf = xl + a;|, the space-hke vectors 
are the exterior of this cone, and the time-like vectors are the two disjoint convex 
sets forming the cone's interior. 

We now concentrate on time-like vectors and prove an analogue of the Cauchy- 
Schwartz inequality. Two vectors x and y are said to be Lorentz orthogonal if 
xoy = 0. If vectors x and y with non-zero Lorentzian norm are Lorentz orthogonal 
then they are linearly independent. 

Proposition 2.3.1. Let x and y be non-zero Lorentz orthogonal vectors in R". If 
X is time-like then y is space-like. 

Proof. Since x is time-like, xl > xl-\ h x^. Therefore, 



i> E-n-r^- 



,1=2 



As X o y = 0, we have Xiyi — ^2^2 H 1- XnUn- Then, using the Cauchy-Schwartz 

inequality for the Euclidean inner product. 



J=2 
n 



i=2 



J=2 



J=2 



1=2 




2 \ -2 

Xi X^ 



i=2 



> 0. 



From the second last line, if Hyp = 
2<i <n. Then 7/1 = (Er=2^i^0^r^ 
||y|| > 0, that is, y is space-like. 



then Yll=2y'i ~ implying = for 
= 0, and so y = 0. But y is non-zero. Thus, 

□ 



An n X n matrix A such that Ax o Ay = x o y for all vectors x and y in is 
said to be Lorentzian. The set of all Lorentzian matrices, denoted by 0(1, n — 1), 
forms a group under matrix multiplication. The subset of matrices A e 0(1, n — 1) 
which transform positive time-like vectors into positive time-like vectors is in fact 
a subgroup of index two of 0(1, n — 1), and is denoted by PO(l,n — 1). A vector 
subspace of R" is said to be time-like if it contains a time-like vector. 
Proposition 2.3.2. For each dimension m, the action of PO{l,n — 1) on the set 
of m- dimensional time-like vector subspaces o/R"- is transitive. 

Proof. The method is a modification of the Gram-Schmidt process. Let V be an 
m-dimensional time-like vector subspace of R". Choose a basis {ui,U2, . . . , u^} 
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for V so that Ui is time-like. Then let wi = Ui/|||ui|||, hence wi o wi = —1. Let 
V2 = Ufc + (ufc o wi)wi, and W2 = V2/||v2||. For 3 < k < n, let the vector be 

k-l 

Vfc = Ufc + (Ufc O Wi)wi - ^(Ufc O Wj)Wj, 

1=2 

and let w^. = v/j/||vfc||. Then for 2 < k < n, we have o wi = 0, and so by 
Proposition 12.3.11 the vector v^, is space-like. Let A be the matrix with columns 
Wi, W2, . . . ,w„, so that A{W^) = V. The following calculation shows that A pre- 
serves the Lorentzian inner product and is thus an element of PO{l,n — 1): 

Ax o Ay = A{xiei H h x„e„) o A{yiei H h y„e„) 

= {xiAei H h x„v4e„) o (yiAei H h ?/n^e„) 

= (XiWi + X2W2 H h X„W„) O (yiWi + I/2W2 H h Vn^n) 

= -XiVi + X2I/2 H h x„y„ 

= xoy. 

□ 

Proposition 2.3.3. Suppose ^and y are positive time-like vectors in M". Then 
X o y < ||x||||y||, with equality if and only if ^ and y are linearly dependent. 

Proof. By Proposition I2.3.2[ there is a Lorentzian matrix A so that Ax = t&i for 
some t > 0. We may thus replace x and y by Ax and Ay without affecting the inner 
product or norms. So we may assume, without loss of generality, that x = xiei. 
Then, 

22 2/ 2 I I 2\ 

< xlyl 
= (xoy)^ 

with equality if and only if ?/| + ■ ■ ■ + = 0, that is, y = yi^i. Now, ||x|| and ||y|| 
are both positive imaginary, so ||x||||y|| < 0. Also, 

X o y = -xiyi < 0. 

Soxoy<||x||||y||, with equality if and only if x and y are linearly dependent. □ 

The following corollary uses this inequality to define the Lorentzian time-like 
angle between positive time-like vectors x and y. The Lorentzian angle is denoted 
?7(x, y), and will be used to define a distance function for the hyperboloid model of 
hyperbolic n-space. 
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Corollary 2.3.4. If x. and y are positive time-like vectors in W^, then there is a 
unique non-negative real number ri{'x,y) such that 

xoy = ||x||||y|| cosh77(x,y). 

Moreover, ?7(x, y) = if and only if x and y are positive scalar multiples of each 
other. 

To conclude our discussion of Lorentzian n-space, we define a Lorentzian cross 
product between vectors in R^. Let x and y be in R^, and let J be the matrix 



The Lorentzian cross product of x and y is defined to be x® y = J(x x y), where x 
is the usual cross product in R^. It can be calculated that x (g) y = Jy x Jx, and 
also that x o y = x • Jy. These identities can then be used to prove the following 
facts about the Lorentzian cross product. 
Lemma 2.3.5. // w, x, y and z are vectors in M.^, then 

1. X o (x (g) y) = y o (x (g) y) = 0, 

2. X (g) y = -y (g) X, 

(Xi X2 
yi z/2 
Zl Z2 

4. X o (y (g) z) = (x (g) y) o z, 

5. X (g) (y (g) z) = (x o y)z — (z o x)y, 

^/ X/ X ,/xowxoz 

0. (x (g) y) o (z (g) w) = det 

^ / V / \^y o w y o 

The following corollaries define the Lorentzian angle ^^(x, y) between pairs of 
space-like vectors x and y, and provide some useful identities involving Lorentzian 
angles and trigonometric or hyperbolic trigonometric functions. 

Corollary 2.3.6. Let x and y be linearly independent positive time-like vectors in 
R^. r/ienxigjy is space-like, anc? ||x (g) y|| = — ||x|| ||y|| sinh?7(x, y). 

Proof. We have 

||x (g) y||^ = (x (g) y) o (x (g) y) 

= (xoy)^ - ||x||^||y||^ 

= ||x|H|y|pcosh2 77(x,y) - ||x|H|y||2 

= ||x||^||y||^sinh^r7(x,y). 

Since ?7(x, y) > 0, and ||x|| and ||y|| are positive imaginary, on taking square roots 
of both sides we get 

||x(g)y|| = -||x||||y|| sinh77(x, y). 
The right-hand side is real and positive, so x (g) y is space-like. □ 
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Corollary 2.3.7. Let x and y be space-like vectors inM.^. //x(S)y zs time-like then 
|xoy| < ||x||||y||, and so there is a unique real number ri{'x.,y) G (0, vr) such that 

xo y = ||x||||y|| cos?7(x,y). 

Proof. Follows immediately from the equality ||xcg)y||^ = (xoy)^ — ||x|p||y|p. □ 

Corollary 2.3.8. Let x and y be space-like vectors in M.^ . // x (g) y zs time-like, 
then 

|||x(8)y||| = ||x|| ||y II sinr7(x, y). 

Proof. By Corollary 12. 3. 7[ since x(S)y is time-like, xoy = ||x|| ||y|| cos77(x, y). Then 
by Lemma [2.3.5[ 

II Il2 / \2 II Il2ll Il2 

||x(g>y|| = (x o yj — ||x|| ||y|| 

II ii2ii ii2 I \ II ii2ii ii2 

= ||x|| ||y|| cos^r7(x,y) - ||x|| ||y|| 
= -||x||^||yf sin^?7(x,y), 

and the conclusion follows from x (g) y being time-like. □ 
2.3.2 The hyperboloid model 

We define if", the hyperboloid model of hyperbolic n-space, to be the set of positive 
time-like vectors of norm i in Lorentzian {n + l)-space: 

if" = {x G M"+^ : ||xf = -1 and Xi > 0}. 

We are disregarding the negative time-like vectors of norm i so as to obtain a 
connected set. The hyperbolic plane is thus one sheet of the two-sheeted hy- 
perboloid = ^2 -l- Xg + 1. Let X and y be vectors in and let ri{'x, y) be the 
Lorentzian time-like angle between them. The hyperbolic distance between x and 
y is defined to be 

c?H(x,y) = r/(x,y). 

Since xoy = ||x|| ||y|| cosh?7(x, y), and the norms of x and y are equal to i, we have 
the explicit formula 

dH(x,y) = cosh~^(-xoy). 
Proposition 2.3.9. The function dfj is a distance function. 

Proof. Since x o y = y o x, (i// is symmetric. By the definition of ri{'x, y) in Corol- 
lary [23]U dn is non-negative. If x = y, then cosh~^(— x o y) = cosh~^(l) = 0, so 
c^i?(x, y) = 0. Conversely, if (i//(x, y) = 0, then by the definition of r7(x, y), the 
vectors x and y must be positive scalar multiples of each other. But x and y have 
the same norm, which implies x = y. 

For the triangle inequality, we need to show that for all x, y and z in H^, 

dni^, z) < dnijs., y) + dnij, z). 

By Proposition 12.3.21 there exists a Lorentzian matrix A such that Ax, Ay and Az 
are in the subspace of M"+^ spanned by ei, e2 and e^. So we may, without loss of 
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generality, assume that n = 2. This allows us to use results about the Lorentzian 
cross product. By Corollary 12.3.61 x (g) y and y ® z are space-like, and 

||x C?) y II = sinh?7(x, y), ||y Cg) z|| = sinh //(y, z). 

We have, also, 

(x (g) y) (g) (y (g) z) = ((x (g y) o y)z - (z o (x (g y))y = -(z o (x (g y))y, 

so the vectors y and (x(gy) <g (y (gz) are linearly dependent. Thus, (x(gy) (g (y (gz) 
is either zero or time-like, so we have by Corollary 12.3.71 that 

|(x (g y) o (y (g z)| < ||x (g y|| ||y (g z||. 

Combining all these facts, we obtain 

cosh(?7(x, y) + ?7(y, z)) = cosh //(x, y) cosh rjiy, z) + sinh ri{-x, y) sinh rjiy, z) 

= (-xoy)(-y oz) + ||x (g y|| ||y (g z|| 
> (x o y)(y o z) + (x (g y) o (y (g z) 
= (xoy)(y oz) + ((xoz)(y oy) - (xoy)(y oz)) 
= —X o z 
= cosh ?7(x, z). 

Since cosh is monotonic increasing, we infer that ^^(x, z) < //(x, y) + ?7(y,z), as 
required. □ 

The topology induced on if" by the distance function (Ih is the same as the 
subspace topology of H", where H"' is regarded as a subspace of IR"^^ with its 
usual metric topology. 

Hyperbolic geodesies 

We now define hyperbolic lines, and show that the hyperbolic lines of are its 
geodesies. A hyperbolic line of is the non-empty intersection of H"^ with a 
two-dimensional vector subspace of ]R"+^. If x and y are distinct vectors in 
then L(x, y) = if" fl span{x, y} is the unique hyperbolic line containing x and y. 
Observe that L(x, y) is one branch of a hyperbola. We first establish a sufficient 
condition for points of if" to lie on the same hyperbolic line. 
Lemma 2.3.10. i/x, y and z are points of H"' and 

^(x,y) +r?(y,z) = r/(x,y), 

then X, y and z are hyperbolically collinear, that is, there is a hyperbolic line of if" 
containing x, y and z. 

Proof. Since spanjx, y, z} has dimension at most 3, we may assume that n = 2 and 
use Lorentzian cross product results. From the proof that dn is a distance function, 
ri{x, y) + r]{y, z) = ?7(x, y) if and only if 

(x ® y) o (y ® z) = ||x ® y|| ||y ® z||. (2.2) 
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Also from that proof, the vector (x(8)y) (g) (y (8)z) is either zero or time-hke. But if it 
were time-hke, equation (12. 2p would contradict Corollary 12. 3. 7[ So (x y ) (y z) 
is 0. Now, since 

(x (g) y) (g) (y (g) z) = -((x O y) o z)y 

and y is time-like, we have that (x (g y) o z = 0. Therefore, by Lemma (2.3.51 the 
vectors x, y and z are linearly dependent. So x, y and z lie in a two-dimensional 
subspace of IR"^^, and are thus hyperbolically coUinear. □ 

The next proposition relates a geodesic curve a to a parametrisation using 
Lorentz orthonormal vectors, and to a differential equation. Two vectors x and y 
in M""*"-^ are said to be Lorentz orthonormal if ||x|p = —1, x o y = 0, and ||y |P = 1. 
This definition means that x must be time-like and y space-like. Note that ei and 
e2 are Lorentz orthonormal. 

Proposition 2.3.11. Let ol : [a,b] — > if" be a curve. Then the following are 
equivalent: 

1. The curve a is a geodesic curve. 

2. There are Lorentz orthonormal vectors x, y in M""*"-^ such that 

cx.{t) = (cosh(t — a))x + (sinh(t — a))y. 

3. The curve a. satisfies the differential equation a" — ct = 0. 

Proof. If A is a Lorentzian matrix then [Acx.)' = Aa', so a satisfies the differential 
equation if and only if Aa does. We may thus transform a by a Lorentzian matrix. 

Suppose CK is a geodesic curve, and t G [a,b]. We prove the second statement 
holds. Now, 

r]{cx{a), a(6)) = b — a 

= {t-a) + {b-t) 

= r]{oL{a), a{t)) + r]{a{t), a{b)). 

By Lemma 12.3.101 a{a), a{t) and a{b) are hyperbolically coUinear. So Q;([a,6]) 
is contained in a hyperbolic line of if". Thus we may assume that n = 1, and 
so consider as the branch of the hyperbola a;^ = 1 + on which xi > 0. By 
Proposition I2.3.2| we may apply a Lorentz transformation to map a (a) to ei, and 
so we may assume that a{a) = Si. Then, 

ei ■ ctit) = (— Q:(a)) o ait) = cosh?7(Q:(a), cx{t)) = cosh(t — a). 

From the equation xf = 1 + x^, we have e2 ■ oc{t) = ±sinh(t — a). Since ex. is 
continuous, either e2 ■ ct{t) = sinh(t — a) for all t, or 62 ■ oc{t) = — sinh(t — a) for all 
t. In the second case, we may reflect in the X2-axis, and so we may assume that a 
has the form 

a{t) = (cosh(t — a))ei + (sinh(t — a))e2. 
Now suppose there are Lorentz orthonormal vectors x and y such that 

a{t) = (cosh(t — a))x + (sinh(t — a))y. 
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Let s and t be such that a < s < t < b. Then 



cosh?7(a(s), ct{t)) 



— a(s) o a{t) 

— (— cosh(s — a) cosh(t — a) + sinh(s — a) sinh(t — a)) 
cosh((t — a) — {s — a)) 
cosh(t — s), 



so ri{cx{s), ait)) = t — s. Therefore ck is a geodesic curve. 

If the second statement holds then just differentiate to find that a" — a = 0. 
Finally, suppose that a" — a = 0. Then 



is a solution of this differential equation which satisfies the initial value conditions 
at if: = a. By uniqueness of such solutions, cy.{t) must have the form ( 12. 3p . We now 
just need to prove that a.{a) and ct'{a) are Lorentz orthonormal. By differentiating 
the equation a{t) o ct{t) = —1 we obtain a.{t) o (x'{t) = for all t. In particular, 
a(a) o oc'{a) = 0. Also, 



\\a{t)f = a{t) o a{t) = cosh^(t - a)\\a{a)f + sinh^(t - a)\\a'{a)\\'^. 
Since a maps into H^, \\a{a)\\'^ = ||a(t)|p = —1, and so ||Q;'(a)|p = 1. This 



The next proposition uses these results to characterise the geodesic lines of H^. 
Proposition 2.3.12. A function A : M — if" is a geodesic line if and only if there 
are Lorentz orthonormal vectors x and y such that 



Proof. Suppose such vectors x and y exist. Then A satisfies the differential equation 
A" — A = 0. By Proposition 12.3. llj the restriction of A to any interval [a, b] is a 
geodesic curve, and so A is a geodesic line. 

For the converse, suppose A is a geodesic line. Then A" — A = so, as in the 
proof of Proposition 12.3.111 

A(t) = (cosht)A(O) + (sinht)A'(O). 

The vectors A(0) and A'(0) are Lorentz orthonormal by the same argument as in 
the proof of Proposition 12.3.111 □ 

Corollary 2.3.13. The geodesies of H"^ are its hyperbolic lines. 

Proof. Suppose A is a geodesic line of if". By Proposition l2.3.1^ there exist vectors 
X and y, one of which is time-like, such that 



a{t) = (cosh(t — a))a.{a) + (sinh(t — a))a'{a) 



(2.3) 



completes the proof. 



□ 



A(t) = (coshi(:)x+ (sinht)y. 



A(t) = (cosht)x+ (sinht)y. 
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Then the image of A is the intersection of and spanjx, y}, which is a hyperbohc 
hne. 

Conversely, let L be a hyperbolic line of H^. As in the proof of Proposi- 
tion 12.3.111 we may assume that n = 1. Then L = H^. Define A : M ^ 
by 

A(t) = (cosht)ei + (sinht)e2. 

Then by Proposition 12.3. 12[ A is a geodesic line which is onto = L. So L is a 
geodesic. □ 

Hyperbolic trigonometry 

We now restrict our attention to the hyperbolic plane H^, and discuss hyperbolic 
trigonometry. For reasons of space, the treatment here is cursory; again, see [21] for 
the details. In particular, the area of regions in the hyperbolic plane is not needed 
at all in later chapters, so we do not define area. 

The definition of a hyperbolic triangle is as follows. Suppose x, y and z in 
are hyperbolically non-collinear. Let L(x, y) be the unique hyperbolic line of 
containing x and y, and let i^(x, y, z) be the closed half-plane of with z in its 
interior and L(x, y) as its boundary. The hyperbolic triangle with vertices at x, y 
and z is defined to be 

T(x, y, z) = i7(x, y, z) n H{y, z, x) n H{z, x, y). 

The sides of this triangle are the geodesic segments [x, y], [y, z] and [z, x]. To define 
the angles of this triangle, let a = (i//(y,z), b = (i//(z,x) and c = (i/^(x, y). Let 
f : [0, a] H^, g : [0, b] and h : [0, c] be the geodesic curves joining y 

to z, z to X, and x to y respectively. Then, the hyperbolic angle a at the vertex x 
is defined to be the Lorentzian angle between the vectors —g'{b) and h'(0). Angles 
f3 and 7 at vertices y and z are defined similarly. The following sketch shows the 
relationships between vectors, sides and angles. 



y 




The trigonometric identities below may be proved using Lemma [2.3.51 and Corol- 
laries [23SH2]3]8l In each of these identities, a, (3 and 7 are the angles of a hyperbolic 
triangle, and a, b and c are the lengths of the opposite sides. 
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Theorem 2.3.14 (Sine Rule). 



sinh a 



sinh b 



sinh c 



sma 



sin (3 



sin7 



Theorem 2.3.15 (First Cosine Rule). 



cosh a cosh b — cosh c 



cos 7 = 



sinh a sinh b 



Theorem 2.3.16 (Second Cosine Rule). 



cosh c = 



cos a cos (3 + cos 7 



sin q; sin (3 



We now generahse the definition of hyperbolic triangle to include the possibility 
of vertices at infinity. Suppose Li and L2 are hyperbolic lines, with Li the intersec- 
tion of with the plane spanjxi, yi}, and L2 the intersection of with the plane 
span{x2, y2}. If spanjxi, yi} fl span{x2, y2} is a one-dimensional subspace of ]R"+^ 
consisting of light-like vectors, then Li and L2 are said to meet at infinity. Hyper- 
bolic lines which meet at infinity are disjoint, but become asymptotically close in 
one direction. A generalised hyperbolic triangle, then, is defined in the same way as 
a hyperbolic triangle, except that the hyperbolic lines defining adjacent sides may 
meet at infinity. The angle at a vertex at infinity is defined to be zero. 

The Second Cosine Rule may be extended to a generalised hyperbolic triangle 
with one vertex at infinity. If such a triangle has angles a, /? = | and 7 = 0, and 
finite side of length c, we obtain 



This shows that there is an absolute unit of length in hyperbolic geometry, as the 
length c depends only on the angle a. 

Theorem 2.3.17. Let T be a generalised hyperbolic triangle. If the angles ofT are 
a, (5 and 7, then the area ofT isTT — a — /3 — j. 

Corollary 2.3.18. The angle sum of a generalised hyperbolic triangle is less than tt. 
2.4 The projective disc model 

The projective disc model is mapped to iJ" by a projection known as gnomonic 
projection. Its chief feature is that its hyperbolic lines are the same as its Euclidean 
lines. 

The open unit disc in R" is the set = {x e K" : |x| < 1}. Identify K" with 
X {0} in ]R"+\ Let fi be the translation of vertically by e„+i, followed by 

radial projection to iJ". The map is known as gnomonic projection, is a bijection, 

and has the explicit formula 



c = cosh 




X + e. 



•n+l 



X + e„+i 
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We now define a distance function do on by 



c?D(x,y) = (ij^(/i(x),/i(y)). 



Tlie projective disc model of hyperbolic n-space is the metric space consisting of 
D"' together with the distance function do- 

A subset L of D"- is called a hyperbolic line of if fi{L) is a hyperbolic line of 
H^. Since fi : D"' is an isometry, and the geodesies of iJ" are its hyperbolic 

lines, the geodesies of D"- are its hyperbolic lines. The following characterisation of 
the hyperbolic lines of shows that in this model, hyperbolic lines are the same 
as Euclidean lines. 

Proposition 2.4.1. A subset L of D"^ is a hyperbolic line of if and only if L is 
an open chord of D"^ . 

Proof. Let L be a hyperbolic line of H^. Then L is the intersection of with the 
plane P(u, 0), where u is some unit vector in M^. Radial projection maps L onto 
the line of intersection of the plane P(u, 0) and the plane P(e3, 1). The translation 
by the vector —63 which follows ensures that /i~^(L) is an open chord of D^. This 
argument can be reversed for the converse. □ 

2.5 The conformal ball model 

The conformal ball model is mapped to iJ" by a projection known as stereographic 
projection. We show that the isometries of this model are Mobius transformations, 
and that the hyperbolic angles of this model are the same as Euclidean angles. 
First, we redefine the Lorentzian inner product on M""*"^ to be 



All results of Section 12.31 are still true, once the order of coordinates in M""*"^ is 
reversed. Identify M'^ with M" x {0} in ]R"+^ Let B"" be the open unit ball in (this 
is the same set as D^, but we will soon be defining a different distance function for 
B^, and so wish to distinguish the two metric spaces). The stereographic projection 
C of -B" onto if" is the projection of x G -B" away from the vector — e„+i until it 
meets if" in the unique point ({'x). The explicit formula for this map is 



The projection is a bijection from i?" to if". 

The conformal ball model of hyperbolic n-space is defined to be the set B 
together with the distance function ds given by 



X O y = Xiyi + X2y2 H h - Xn+iyn+l- 




(2.4) 



c?B(x,y) = rfH(C(x),C(y))- 
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By the explicit formula for dn, and (12.41) . 



cosh(iH(C(x),C(y)) = -C(x)oC(y) 



-4x • y + 


(1 + 


X 


m + \ 


|y| 




(1- 


X 






|y| 


P) 



= 1 

Thus an explicit formula for is 



2 


|x-y| 


2 


(1-|X|2)(1- 




|y| 


P) 



2 


|x-y| 


2 


(1-|X|2)(1- 




|y| 


P) 



The topology induced on B'^ hj cIb is the same as the subspace topology of 5" as 
a subspace of M". 

A subset L of i?" is defined to be a hyperbolic line of -B" if C(-^) is a hyperbolic 
line of H^. By Corollary 12.3.131 and the fact that ( is an isometry, the geodesies of 
5" are its hyperbolic lines. We now characterise the hyperbolic lines of B^. 

Proposition 2.5.1. A subset L is a hyperbolic line of B^ if and only if L is either 
an open diameter of B"^, or the intersection of B"^ with a circle orthogonal to S^. 

Proof. Let L be an open diameter of B^. Then L is the intersection of B^ with 
the one-dimensional subspace of spanned by, say, the vector v. The projection 
( maps L onto the hyperbolic line of obtained by intersecting with the 
two-dimensional subspace spanjv, 63}. Thus L is a hyperbolic line of B^. This 
argument can be reversed to show that if C(-^) is a hyperbolic line of which 
passes through the point 63, then L is an open diameter of B^. 

Now, if X = {xi,X2,X3) G then xf + X2 — x^ = —1. The inverse of the 
projection ( is thus the map 

C^{xi,X2,X3) = \—{xi,X2). 

Let i? be a rotation of M.^ about the 2:3 axis, and identify R with a rotation of 
M^. Then (~^{R{x.)) = i?((^^^(x)) for any x G H^. So we may consider, without 
loss of generality, a hyperbolic line of which is of the form fl P(w, 0), where 
w = (-sin ^,0,cos^) for some ^ G (f,f). Let L = C"H^^ H P(w, 0)). Then L is 
a hyperbolic line of B^. Now, x G -P(w, 0) if and only if 0:3 = xi tan^. This means 
that X G -ff^ n P(w, 0) if and only if 

2 2 i 2 /) 2 1 
X2 = Xi tan u ~ Xi — I. 

Abbreviate tan^^ to t. Then 

C-(x) = ^(-1,-2) = ^ (x,, ^xjt^ . 
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The circle in with centre {t, 0) and radius ^/t"^ — 1 meets and is orthogonal 
to it. See the diagram in the proof of Lemma [2.5.21 below. We have 

So L, the image of fl P(w, 0) under lies in the circle centre (t, 0) and radius 
yW—l. Since hyperbolic lines of are connected, and is continuous, L is 
a connected arc. As Xi — oo, the arc L approaches the boundary of B^. Thus 
L is the intersection of with a circle orthogonal to S^. This argument can be 
reversed to prove the converse. □ 

2.5.1 Mohius transformations and isometries 

A map (f) : E"' — > E"' is said to leave -B" invariant if (p maps -B" bijectively onto itself. 
(If, in addition, 0(x) = x for all x in 5", we say that (p fixes each point of 5".) We 
define a Mobius transformation of 5" to be a Mobius transformation of E"- which 
leaves B"^ invariant. Our aim is to show that I{B'^), the group of isometries of -B", 
is isomorphic to M{B'^), the group of Mobius transformations of B^. 

The following lemma gives sufficient and necessary conditions for a refiection 
in a sphere to leave -B" invariant. Two spheres are said to be orthogonal if they 
intersect in i?", and at each point of intersection their normal lines are orthogonal. 
Lemma 2.5.2. Let a he the reflection of E^ in the sphere S'(a, r). Then a leaves 
B'^ invariant if and only if S{a,r) is orthogonal to S'^~'^ . 

Proof. By considering the intersection of S'(a, r), i?" and 5*"""^ with two-dimensional 
subspaces of E"' which contain the vector a, we may assume that n = 2. The circles 
S{sL,r) and 5*^ are orthogonal if and only if, at their points of intersection, their 
respective radii are orthogonal. Thus, ^(a, r) is orthogonal to if and only if 
|aP = + 1. 
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Suppose S{a.,r) is orthogonal to S^. Then for x e B^, 

k(x)r = 



a + r 



X — a 



a + 



|x — a|' 

(|aP-l) 
|x — ap 



(x-a) 



X — a 


^ + (1- 


X 




a 






X — 


a 


2 



<1, 

since |x| < 1 and |a| > 1. Thus o^B"^) is contained in 5^. To show that a is 
onto S^, let y be in B'^. Then, since a is its own inverse, (7(x) = y if and only if 
X = (T(y). But (7 maps B"^ into S^, so x e S^, hence cr is onto. Therefore cr leaves 
B^ invariant. 

For the converse, we identify B"^ with the open unit disc in C and 5*^ with the 
unit circle in C. Without loss of generahty, we may consider the centre of the 
circle 5" (a, r) to lie on the real axis at a point a > 0. We prove the contrapositive. 
Suppose S{a,r) is not orthogonal to S^. We show that there exists a point of B"^ 
which is mapped outside B"^ hj a. If a < 1, let (5 > be such that a + 5 < 1. Then 
a + 5 & B^, but for sufficiently small 5, 

a(a + 6) = a + — > 1. 



If a > 1 and < + 1, then for small enough 5 > 0, we have + 6{a + l) < + 1 
and 1 — S e B'^. However, 



[1-5) 



< -1. 



Finally, if a > 1 and > 1 + r^, then choose 5 > so that — 6{a — 1) > 1 + 
and —1 + 6e B^. We have a(— 1 + 6) > 1. So, if S{a,r) is not orthogonal to S^, 
the reflection a does not leave B'^ invariant. □ 

Lemma 2.5.3. If (p is a Mobius transformation of leaving B"^ invariant, and 
X, y are in B^ , then 





x-y 


2 


(l-|x|2)(l 




y 





Proof. Since is a composition of reflections, we may assume that is a reflection. 
If </) is a reflection in a plane then, in order that leave i?" invariant, must 
be a reflection in a plane passing through the origin. Then (f) preserves Euclidean 
distances, and 0(0) = 0, so |0(x)| = |x|, |0(y)| = |y| and |0(x) — 0(y)| = |x — y|. 
The result follows immediately. 
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We are left with the possibihty that is a reflection in a sphere ^(a, r). By 
Lemma I2.2.7[ 

|(/>(x)-0(y)| _ 

|x — y| |x — a| |y — a| 

By Lemma r2.5.2[ ^(a, r) must be orthogonal to S*""^, and from its proof, this means 
that |ap = + L The formula for the reflection is 

(/)(x) = a + -(x - a), 

|x — a|"^ 

hence 

|0(x)|2-l = |ap-l + - ^a-(x-a) + r 

|x — a|"^ |x — a|"^ 



^2 


X — a 


P + 


2r2a- 


(x — a) + r"^ 




X — a 


2 



_ r2(|x|2 - 1) 
|x — ap 

Putting all this together, we obtain 

l0(x)-0(y)P ^ ^ (l-|0(x)p)(l-|0(y)p) _ 

|x — yP |x — aP|y — aP (1 — |xp)(l — |yp) 

Rearrange this equation to finish the proof. □ 

Theorem 2.5.4. Every Mohius transformation of which leaves 5" invariant 
restricts to an isometry of , and every isometry of B"^ extends to a unique Mohius 
transformation of E"" which leaves 5" invariant. 

Proof. The claim that Mobius transformations restrict to isometrics follows imme- 
diately from the explicit formula for ds together with Lemma [2.5.31 

Conversely, let : — >■ be an isometry. We construct an extension for as 
follows. First, for any b G -B", we construct a Mobius transformation Tb such that 
Tb leaves B^ invariant, and Tb(0) = b. Next, we consider the map t^(o)0; which 
fixes 0, and show that this map is orthogonal. It follows that t^(o)0 extends to a 

Mobius transformation of We then prove that this extension is unique. 

Let S{a,r) be a sphere of E" orthogonal to S"""^, and let be the reflection 
in this sphere (since = |ap — 1, the radius r is a function of a). Let pa be 
the reflection in the hyperplane through normal to a. Then by Lemma I2.5.2[ 
(Ta and Pa leave B"- invariant, so the composition PaO'a leaves invariant. A 
calculation shows that PaC"a(0) = — a/|ap. For b 7^ in 5", set b' = — b/|bp. If 
r = (|b'p — 1)-^/^ then S{h',r) is orthogonal to S*""^, so we may deflne a Mobius 
transformation of i?" by the formula Tb = ph'O'h'- Deflne tq to be the identity map. 
Then for all b in 5", rb(0) = b. 
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We now define a map i}) : -B" — > 5" by ■?/' = t^(o)<^. We have '?/'(0) = 0. 
And, since t/^ is a composition of an isometry and a Mobius transformation, by the 
first part of this theorem, if) is an isometry of B^. Let x and y be in B^. Since 
c?B('i/'(x), 0) = ^^(x, 0), we have 

l^(x)P _ |xp 

l-|^/'(x)|2 l-|x|2' 

hence iV'lx)! = |x|. Also, since ^^(^/'(x), ^/'(y)) = ^^(x, y), we have 

|^(x)-^(y)P _ |x-yp 

(l-|^(x)P)(l-|^(y)P) (l_|x|2)(l-|y|2)' 

and so |'?/'(x) — ip{y)\ = |x — y|. Thus ip preserves Euclidean distances in B". It is 
known that any mapping ip of the open ball which preserves Euclidean distances 
and fixes the point is the restriction to 5" of an orthogonal transformation, say 
A, of E"'. Hence, t^(o)^ is an extension of (p. Now, A is a Euclidean isometry, so 
is also a Mobius transformation of Thus T(^(o)^ is a Mobius transformation of 
which leaves B"' invariant and extends 0. 

To show that this extension is unique, suppose that v is another Mobius trans- 
formation of -B" which extends 0, and let ^ be the composition v~^t^(^o)A- By the 
continuity of ^, we have ^(x) = x for all x G B". So by Lemma 12.2.111 ^ is the 
identity. Therefore T(f,(^o)A is the unique Mobius extension of (p. □ 

Corollary 2.5.5. The groups I{B'^) and M{B'^) are isomorphic. 

We can now explain why the hyperbolic angles of 5" are the same as its Eu- 
clidean angles. From the geometric definition of the map C, : -B" H"^, this projec- 
tion preserves the Euclidean angle between any two hyperbolic lines of -B" which 
intersect at the origin. By considering the tangent space to at ({0) = e„_|_i, 
it can be proved that the hyperbolic angle between two hyperbolic lines of if" in- 
tersecting at C(0) is the same as the Euclidean angle. So, the hyperbolic angle 
between two hyperbolic lines of -B" which intersect at the origin is the same as the 
Euclidean angle. Moreover, the isometrics of B"' are Mobius transformations, which 
are conformal and act transitively on the set of hyperbolic lines of 5". Therefore, 
the hyperbolic angle between any two intersecting hyperbolic lines in B"' is the same 
as the Euclidean angle between these lines. This is why i?" is called the conformal 
ball model of hyperbolic ra-space. 

2.6 The upper half-space model 

The upper half-space model is mapped to B"' by a Mobius transformation. As in the 
conformal ball model, we may identify isometrics and Mobius transformations. In 
the two-dimensional case, there are many interesting relationships between isome- 
trics, maps known as linear fractional transformations, and groups of matrices. 

Consider the upper half-space ?7" = {x G E"' : Xn > 0}. There is a standard 
transformation r] from [/" onto the open unit ball S". It is given by r) = ap, 
where p is the reflection in the boundary E""'^, and a is the reflection in the sphere 
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S{en, \/2). The upper half-space model of hyperbolic space is the set U"-' together 
with the distance function du defined by 

The topology induced on f/" by du is the subspace topology of U'" as a subspace of 
M". We will mainly be considering the upper half-plane U"^, which may be identified 
with the set of complex numbers 

{zeC: lm{z) > 0}. 

Lemma 2.6.1. The explicit formula for du is 



rfc/(x, y) = cosh ^ 
Proof. From the explicit formula for ds, we have 
cosh(i(/(x, y) = cosh ^^(//(x), //(y)) = 1 



2|ap(x)-ap(y)p 



(l-|ap(x)P)(l-|ap(y)P)- 
Then, by Lemma 12.2.71 applied to a, and p being a Euclidean isometry, we have 



|ap(x) - (Tp(y)l 



2|p(x) 


-p(y)l 




2 


X 


- y 




|p(x) - e„ 


|p(y) - e„ 




X + e„| 


|y^ 


1 



Also, 



|ap(x) 



en + 
1 + 
1 + 



2(p(x) 



|p(x) -e„|2 
4e„ ■ (p(x) - e 
|p(x) -e„|2 
4[p(x)]„ 



+ 



|p(x) 



hence 



x + 



|(Tp(x) 



|x -I- 

These results may be combined to complete the proof. 



□ 



Corollary 2.6.2. Let ia and ib, where b > a > 0, be points on the imaginary axis 
in the upper half-plane U"^ . Then the hyperbolic distance between ia and ib is log ^. 

A subset L of is, by definition, a hyperbolic line of if ri{L) is a hyperbolic 
line of -B". Thus the geodesies of are its hyperbolic lines. Now, the boundary 
of is mapped onto the boundary of by rj which is a Mobius transformation. 
So rj is conformal and maps spheres to spheres. By Proposition 12.5. a hyperbolic 
line of is the intersection of B^ with either a Euclidean line or a Euclidean circle 
which is orthogonal to its boundary S^. Hence, L is a hyperbolic line of U"^ if and 
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only if L is the intersection of f/^ with either a Euclidean line or a Euclidean circle 
which is orthogonal to the real axis. 

Another consequence of 77 being conformal is that, since i?" is a conformal model 
of hyperbolic n-space, f/" is also a conformal model. 

2. 6. 1 Mobius transformations, isometrics and linear fractional transformations 
As in the conformal ball model, we may identify the Mobius transformations and 
isometrics of f/". A Mobius transformation of f/" is a Mobius transformation of E"^ 
which maps f/" bijectively onto itself. 

Theorem 2.6.3. Every Mobius transformation of E"' which leaves U"" invariant 
restricts to an isometry ofU"', and every isometry ofU^ extends to a unique Mobius 
transformation of E"- which leaves U"' invariant. 

Proof. Follows immediately from Theorem 12.5.41 and the fact that rj is an isometry. 

□ 

Corollary 2.6.4. The groups I{U^) and M{U"') are isomorphic. 

We now consider just the two-dimensional case, where U'^ is the complex upper 
half-plane. An important class of maps on C is the linear fractional transformations. 
A linear fractional transformation is a continuous map : C ^ C of the form 

cz + a 

where a, b, c and d are in C and ad — be ^ 0. Let LF{C) be the set of all linear 
transformations of C. Let B{C) be the set of all bijections of C. Then B{C) is a 
group under composition. Note that a linear fractional transformation is a bijection, 
so LF{C) C B{C). 

We will need the definitions of several groups of matrices. First, GL{n, C) is the 
group of invertible n x n complex matrices. Then, GL{n, C) contains the subgroup 
SL{n, C), which is the group of n x n complex matrices which have determinant 1. 
The groups SL{n, M) and SL{n, Z) are defined similarly. The group SO{n, M) is the 
group of real orthogonal matrices with determinant 1. So SO{n,M.) is a subgroup 
oiSL{n,R). 

We discuss the relationships between LF(C), Mobius transformations, groups 
of matrices and the isometrics of U'^. Let S : GL{2, C) LF{<C) be the map 

b\\: az + b 
c d))^^^-^r+d- 

Lemma 2.6.5. The set LF{C) is a group under composition. 

Proof. The map S is into the group B{C). For g,h & GL{2, C), it can be checked 
that Sigh), the image of the matrix product gh, is the same map as the composition 
S((yf)H(/i). Thus, S is a group homomorphism. Therefore, the image of S is a 
subgroup of B{C). Since S maps onto LF(C), we conclude that LF{C) is a group. 

□ 
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Lemma [2.6.51 permits us to define an action of tlie group GL{2, C) on C. Let g 
be tlie matrix ^ G GL{2, C). Tlie action is defined by setting gz = {E{g)){z), 



that is, 



We now define some quotient groups. Let C* be C \ {0}. The group PGL{n, C) 
is the quotient of the group GL{n, C) by the normal subgroup {XI : A G C*}. The 
group PSL{n, C) is then the quotient 

SL{n,C)/{SL{n,C) n {A/ : A G C*}). 

The groups P5'L(r2,]R) and PSL{n,7j) are defined similarly. Then, if n is even, 
PSL{n,R) is SL{n,M.)/{±I}, and PSL{n,Z) is SL{n,Z)/ {±1}. If n is an odd 
integer, we have PSL{n,M.) = SL{n,'R) and PSL{n,7j) = SL{n,7j). The fact 
that the groups PGL{2, C) are PSL{2, C) isomorphic is used to prove the following 
corollary. 

Corollary 2.6.6. The group LF{<C) is isomorphic to the group PSL{2,C) . 
Proof. The homomorphism S : GL{2, C) —>■ LF{<C) is onto, so 

LF{C) ^ 6-^(2, C)/kerH. 
Suppose 9 = ^ is in the kernel of S. Then, for all z G C, 

az + b 

; = ^• 

cz + a 

Rearranging, we obtain the equation cz"^ + {d — a)z + b = 0. Since this equation 
holds for all z, we have c = 0, d = a and 6 = 0. Hence, the kernel of S is the 
subgroup {AJ : A G C*}. Therefore LF{C) ^ PSL{2, C). □ 

Proposition 2.6.7. Every linear fractional transformation of C is a Mobius trans- 
formation ofC. 

Proof. We first show that the group SL{2, C) is generated by matrices of the form 

- (J T) " - (-1 I 

where m is a complex number. For complex numbers m, n and p, consider the 
product 

uim)vuin)vuip)= i ^ ^^J^^ Jg^L(2,C). 

Let g = !; ) ^ SL{2,C). Suppose first that c ^ 0. Set n = c, and then choose 



c d 

m and p so that mn — 1 = a and np — 1 = d. Since both g and the product 
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u{m) V u{n) V u{p) have determinant 1, we must have b = mnp — p — m. Thus 
g = u{m) vu{n) vu{p). Now suppose that c = 0. Since det(5') = 1, a 7^ 0. We have 



f a b\ I l\ f a b\ ( —c ~d 

V ' 1 — 1 11 1 — 1 



c dj \— 1 J \c d J \a b 

which may be written u{m) v u{n) v u{p) for some complex numbers m, n and p, 
since a 7^ 0. Thus g = v u{m) v u{n) v u{p) . Therefore S'L(2,C) may be generated 
by the matrix f , and matrices of the form u{m), where m G C. 

Let 0n(m) = S(M(m)) and 0„ = Then 4>u(m)(yZ) = z + m and = — 

We have v"^ = -I, and, from Corollary ESU LF{C) = PSL{2,C). Hence LF{t) 
is generated by the maps 0u(m) and 0„. So it now suffices to prove that 0u(m) 
and 0„ are Mobius transformations. First, (t)u{m.) is a translation, and translations 
are isometrics. Therefore 0u(m) is a Mobius transformation. For 0^,, let a be the 
reflection in the unit circle S"^, and let r be the reflection in the imaginary axis, 
that is, t{z) = —z. Then 



Ta{z) 



l\ 1 



, 12 / = ^ , , 



hence (p^ = rcr. Thus 0^, is a Mobius transformation. □ 

Proposition 2.6.8. Let p : C ^ C be complex conjugation, that is, p{z) = z. Then 
M(C) = LF{C) U LF{C)p. 

Proof. Since p is a reflection (in the real axis), and Proposition 12.6.71 shows that 
LF[C) C M(C), we have the inclusion 

LF{C) U LF{t)p C M(C) 

immediately. 

For the other inclusion, let be a Mobius transformation of C. Since is a 
composition of reflections in spheres of C, we first consider the cases where is a 
single refiection. Suppose that is the refiection in the circle S{a,r). Then 



2 z — a az + r"^ — aa 

a + r -r = a + = = — = ^jplz) 

z — ar z — a z — a 



where ip is the map given by 

, , , az + ir"^ - |aR 

■^{z) = = . 

z — a 

Since 

a(-a) - (r^ - \a\'^){l) = -|ap - + jap = -r^ ^ 0, 
the map -0 is a linear fractional transformation. Hence, G LF{<C)p. 



44 



Now suppose that is the reflection in the line P{u, t). Let 0o be the reflection 
in the line through the origin P{u,0). As P{u,t) is the image of P{u,0) under 
translation by tu, we have 

^{z) = (j)o{z - tu) + tu. (2.6) 

Next, the line P{u, 0) is the image of the real axis under rotation through an angle 
of 6' + ^, where u = e*^. Since p is reflection in the real axis, 

= e'(^+i)p(e-^(^+i);2) = tmuz = -u^z. (2.7) 

Combining (12. 6p and (12.71) . we obtain 

(j){z) = —u^z + tu^u + tu = i'piz), 

where ip is the linear fractional transformation 



—u^z + tv?u + tu 

0^ + 1 ■ 



Thus e LF{t)p. 

We now consider the effect of composing reflections. Let (j) and 0' be in LF{<C). 
By expanding out, it can be shown that the composition (f)p(f)'p is in LF{C). The 
composition 00' is a linear fractional transformation, as LF{C) is a group. Also, 
both of the compositions 0p0' and 00'p are in LF{C)p. Thus every Mobius transfor- 
mation is either in LF[C), or in LF{C)p, depending on whether it contains an even 
or odd number of reflections respectively. Hence M(C) C LF{C) U LF{<C)p. □ 

The Mobius transformations belonging to LF{<C) are usually called orientation 
preserving. 

Lemma 2.6.9. A linear fractional transformation ofC leaves U"^ invariant if and 
only if there exist real numbers a, b, c, d with ad — be = 1 such that 

J , , az + b 

n^) = — — 1- 

cz + d 

Proof. Suppose there exist real numbers a, b, c, d with ad — be = 1 such that 
0(z) = {az + b)/{cz + d). Recall that is a bijection. Let z G U"^. Then 

lm(0(.)) = > 0. 

\cz + dy 

Thus maps into f/^. To show that is onto t/^, let w be any point of U'^. Then 
0(^) = w ii and only if 

b — dw 



cw — a 

Here, lm{z) = lm{w)/\cw — > 0, so is onto U^. Thus leaves f/^ invariant. 
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Conversely, suppose (f) leaves U"^ invariant. Since is a homeomorphism, (/) maps 
the boundary of f/^, which is i?^ = M U {oo}, bijectively onto itself. We know that 
for some complex numbers a, b, c and d with ad — be ^ 0, 

, , . az + b 

nz) = — — 1- 

cz + d 

We consider two cases, the first being if 6 = 0. Since ad — be ^ 0, it follows that 
a 7^ 0. Then, we may divide a, b, c and dhj a without changing the function 0, so 
we may assume that a = 1. We have 0(1) = l/{c + d) and 0(— 1) = — 1/(— c + d) in 
E^, so c + d and —c + d are real. Thus c and d are real. 

The second case is when 6 7^ 0. By dividing each of a, b, c and d by b, we may 
assume that 6 = 1. Then 0(0) = l/c? G E'^, so c? G M. If a = then c 7^ 0, and 
0(1) = l/{c + d) e E^, so c + d e R, implying c G M. If a 7^ 0, we have that 
0(-l/a) = 0, hence a G M. Then, 

implying c G M. 

Since maps the upper half-plane onto itself, lm(0(i)) > 0. We find that 
lm(0(i)) = {ad — be) / {c^ + d"^) . Therefore, the determinant ad — be must be positive. 
Let A = a /^/ ad — be, B = b/^/ad — be, C = e/yad — bc and D = d/ yad — bc. 
Then A, B, C and D are real, AD - BC = 1, and 0(z) = {Az + B)/{Cz + D). □ 

Theorem 2.6.10. Let —p be the map {—p){z) = —z. Then 

I{U^) = E{SL{2, M)) U E{SL{2, M))(-p). 

Proof. Let be in 'E{SL{2, M)). By Proposition 12.6.71 is a Mobius transformation 
of C. By Proposition I2.6.9"| leaves U"^ invariant, so is a Mobius transformation 
of U"^. Thus, by Theorem 12. 6. 3^ restricts to an isometry of U"^. Since —p is a re- 
flection, which moreover leaves f/^ invariant, 0(— p) is also a Mobius transformation 
of f/^. Hence, the composition 0(— p) is an isometry of U'^ as well. 

For the other inclusion, let be an isometry of f/^. By Theorem I2.6.3[ 
extends to a unique Mobius transformation of f/^. So by Proposition I2.6.8| either 
G LF{C), or G LF{t)p. 

If G LF{<C), then by Lemma [2.6.91 since leaves f/^ invariant, is the image 
of some matrix in SL{2,'R) under the map S. If G LF{<C)p, let r be the linear 
fractional transformation t{z) = {—z + {))/{Qz + 1) = —z. Then {~p){z) = Tp{z), 
and so LF{C)p = LF{C)tp = LF{C){—p). Now, the map — p leaves f/^ invariant, 
and leaves f/^ invariant, so has the form 

a(-^) + 6 
c(— z) -I- a 

where a, b, e and d are real and ad — be = 1. Hence G S(S'L(2, ]R))(— p). □ 
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2.6.2 Topological groups and discrete subgroups 

A topological group is a group G which is also a topological space, such that the 
multiplication {g, h) i— >■ gh and inversion g i— > g^^ are continuous functions in G. 
Many of the groups discussed in this chapter are topological groups. Groups of 
matrices such as GL{n, C) and SL{n, M) are topological groups, and their topology 
is the metric topology induced by the distance function 

d{A,B) = \A-B\, 

where the right-hand side is the matrix norm defined by 

1/2 

\A\ 

\i,j = l 

Then, quotient groups such as PSL{2,M.) are topological groups as well, equipped 
with the quotient topology. The group of isometries of or f/", and the group 
of linear fractional transformations of C, are also topological groups. They are 
topologised with the subspace topology inherited from the spaces C{B"-), C{U^) or 
C(C), respectively, of continuous maps with the compact-open topology. 

An isomorphism of topological groups G and if is a map (p : G ^ H which is 
both a group isomorphism and a homeomorphism. The isomorphism of LF{C) and 
PSL{2, C) established in Proposition 12.6.61 is an isomorphism of topological groups. 

Let r be a subgroup of a topological group G. Then F is said to be a discrete 
subgroup of G if for all 7 G F there exists an open neighbourhood of 7 in G such 
that ^] n F = {7}. Recall that the groups SL{2,R) and PSL{2,R) act on by 
isometries. We now show that SL{n,Z) is a discrete subgroup of SL{n,M.), and 
that PSL{n,Z) is a discrete subgroup of PS'L(n,]R). 

Lemma 2.6.11. Let G be a topological group which is also a metric space, and 
let T be a subgroup of G. Suppose every convergent subsequence in F is eventually 
constant. Then F is discrete. 

Proof. Suppose that every convergent subsequence in F is eventually constant, but 
that F is not discrete. Then there is a point 7 G F such that for all n > 1, the ball 
5(7, l/n) contains a point 7^ of F which is different from 7. Then 7„ 7, but the 
sequence {'^n\^=i is not eventually constant, which is a contradiction. We conclude 
that F must be discrete. □ 

Proposition 2.6.12. A subgroup F of SL{n,M.) is discrete if for all r > the set 

{A e T : \A\ < r} is finite. 

Proof. Suppose that {A E T : \A\ < r} is finite for each r > 0. Let Bj ^ i? in F. 
As the norm function is continuous, \Bj\ — > \B\. Thus, there exists an integer jo 
such that for all j > jo, \\Bj \ - < 1. If \\Bj\ - < 1 then \Bj\ < 1 + 
Fix r = 1 + \B\, then the set {A e T : \A\ < 1 + \B\} is finite. Hence, the 
sequence {Bj} is eventually constant. Since S'L(n, M) is a metric space, we may 
apply Lemma [2.6. Ill to conclude that F is discrete. □ 

Corollary 2.6.13. The group SL{n,7j) is a discrete subgroup of SL{n,M.) . 
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Corollary 2.6.14. The group PSL{n,Z,) is a discrete subgroup of PSL^UjE.) . 

We now discuss group actions in greater detail. Let G be a group acting on a 
set X and let x be an element of X. The subset of X given by Gx = {gx : g E G} 
is called the G-orbit of x, or the orbit of x under G. The G-orbits partition X. 
Lemma 2.6.15. If F is a discrete group of isometrics of a metric space X, and K 
is compact in X, then K contains only finitely many points of each orbit Vx. 

Proof. Let K be compact in X. Suppose K contains infinitely many points of an 
orbit Vx. Then K contains a convergent subsequence {'^nx} of Vx, and there are 
infinitely many distinct points 7„a;. Since V is topologised with the compact open 
topology, 7„a; ^ '-jx in K if and only if 7„ — 7 in F. As F is discrete, the sequence 
{7„} is eventually constant, so the sequence {^n^} is eventually constant, that is, 
it contains only finitely many distinct elements. This is a contradiction. □ 

The stabiliser subgroup Gx of a; in G is the subgroup {g E G : gx = x}. As 
a useful application of the ideas of orbits and stabiliser subgroups, we have the 
following decomposition of 5'L(2,Z). 

Lemma 2.6.16. Let 7 e 5'L(2,Z). Then 7 may be expressed in the form kiak2, 
where ki and k2 are in S0{2,M.), and a is in SL(2,'R) with the form ( ^ g-i ) ^'^^ 



some s > 1. 



Proof. We first show that 5*0(2,1^) is the stabiliser subgroup of z — i m SL{2, '. 
Let k e 5'G(2,M). Then k has the form 



k(j 



cos 6 — sin 6 
sin 6 cos 9 



for some 9 e [0, 27r). It can be calculated that ke fixes the point i. Now suppose 

that k is the matrix in SL{2,'K), and that ki = i. Then ai + b = di — c, 

hence a = d and b = —c. Since ad — 6c = 1, we have + = 1, and so the matrix 



k 



a b\ _ fa —c 
c dl \c a 



is an element of S0{2,R). Therefore, if /c e S'L(2,M), ki — i if and only if k is in 
SO{2,R). 

We next show that for each r > 0, the orbit of the point e^i under the group 
5*0(2, M) is the Euclidean circle with Cartesian equation 

x^ + (y — coshr)^ = sinh^ r. 

We will denote this circle by Cr- When r = 0, the point e*"? is just i. Since 
50(2, M) is the stabiliser subgroup of i, the orbit is then the single point i. For 
r > 0, a tedious calculation shows that for any kg e 50(2, M), the point koe^i lies 
in the circle G^. We now argue that the orbit is onto the circle G^. The action of 
50(2, R) on the point e~''i may be regarded as a continuous function of 9. Thus, 
the image of the interval [0, 7r/2] is a connected subset of G^. When 6' = 0, we get 
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kge'^i = e^i. When 9 = 7i/2, we get kge'^i = e^^'i. For < < 7r/2, the real part of 
kge'^i is negative. Hence, the image of [0,7r/2] is those points of the circle Cr with 
non-positive real part. Similarly, the image of the interval [7r/2, vr] is those points of 
Cr with non-negative real part. Therefore, the orbit of the point e''z under S0{2, M) 
is the whole circle Cr- 

We now apply these results to complete the proof. Let p be the point 'ji. Then, 
since p lies in the orbit Cr for some r > 0, there exists a G S'0(2,]R) such that 

kp = e^i. Let a be the matrix ( ^ ^^^2 1 ■ Then ae^i = i. Let k' = ak'j. Then 

k'i = i, so k' G 50(2, M). Making 7 the subject and then relabelling, we obtain 
7 = k~^a~^k' = kiak2. □ 

2.6.3 Tesselation by fundamental domains 

In this subsection, we provide general definitions of concepts needed to describe 
tesselations of metric spaces which are induced by the action of discrete groups of 
isometrics. Most of the proofs are for just the upper half-plane. However, with a 
few exceptions, the proofs for the general case are not much different. See Chapter 
6 of [21] for a full treatment of the general case. 

Let {X, d) be a metric space and let F be a non-trivial group of isometrics of X. 
A subset i? of X is said to be a fundamental region for the group F if 

1. the set R is open in X, 

2. the members of {'jR : 7 G F} are mutually disjoint, and 

3. X = U{7;R : 7 e F}. 

A fundamental domain is a connected fundamental region. Now, if iS is a collection 
of subsets of X, then S is described as locally finite if for each point x G X, there 
is an open neighbourhood of x which meets only finitely many members of S. We 
say that a fundamental domain D is a locally finite fundamental domain if the 
collection {7-D : 7 G F} is a locally finite collection of sets. 

A subset F of X is a fundamental set for the group F if F contains exactly one 
point from each F-orbit in X. Note that a fundamental domain is not a fundamental 
set. As the orbits of F on the space X partition X, if F is a fundamental set, then 
we have X = U{7F : 7 G F}. 

We consider the metric space f/^. One feature of the upper half-plane is that 
any isometry of which fixes every point of a non-empty open subset of is the 
identity map. This fact is used to prove the following lemma. 
Lemma 2.6.17. An open subset R of the metric space U"^ is a fundamental region 
for a group F of isometrics of U"^ if there is a fundamental set F for F such that 
RCFCR. 

Proof. Suppose that there exists a fundamental set F for the group F such that 
R (1 F ^ R. We first show that the elements of the set {'jR : 7 G F} are mutually 
disjoint. Suppose that 7 and 6 are elements of F such that •yR fl SR is non-empty. 
Then, there are points z and w in R such that jz = 6w. Hence, S'^jz = w. As 
R ^ F, the points z and w are in F. Since F contains only one point from the 
orbit Tz, we have 6~^'~fz = z. Thus the isometry 6~^'~f fixes each point of the open 
set R n ■y~^6R. Therefore, ^"^7 is the identity, and so 5 = 7. 
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Next, since F C i?, we have 



X=\J^F=[j^R. 

Therefore i? is a fundamental region for F. □ 

We now describe a general method for constructing a fundamental domain. 
Suppose there is a point po in X which has a trivial stabiliser subgroup Tp^. For 
each 7 7^ 1 in F, we define the open half-space H^{po) by 

H-tipo) = {x e X : d{x,po) < d{x,-fpo)}. 

Then, the Dirichlet domain D[pq) with centre po is defined to be the intersection 
of these half-spaces: 

D{po) = f]{H,{po) : 7 7^ 1 in F}. 

If F = PSL{2,7j), we may regard F as a subgroup of the group of isometries 
of U"^. We will, abusing notation, write 7 for both a matrix 7 G SL{2, Z), and the 
coset of this matrix in PSL{2, Z). There are uncountably many points of f/^ which 
have trivial stabilisers in F. This is because, if 




then by the quadratic formula z = {{a — d) ± ^/ {d ~ a)^ + Abe) /2c, and Z is count- 
able. Some points which do have trivial stabilisers are all the points ti, where t > 1. 
We now show that in the upper half-plane, a Dirichlet domain with centre at po 
is a locally finite fundamental domain. To show that -D(po) is open, we will show 
that its complement is closed in f/^. In general, a union of closed sets need not be 
closed. However, if a collection S of closed subsets is locally finite, then the union 
of the members of S is closed. 

Proposition 2.6.18. Let F = PSL{2,7j) and let po be a point of U"^ such that Tp^ 
is trivial. Then the Dirichlet domain D{pq) is a locally finite fundamental domain 
for the action o/F on U^. 

Proof For each 7 ^ 1 in PSL{2,Z), let = \ H^{po). Then is closed. 
We show that /C = {K^ : 7 7^ 1 in F} is a locally finite collection of sets. Let z be 
any point of U^, and let r > be such that B{pQ,r) contains z. We will show that 
B{po,r) meets only finitely many of the sets in JC. If B{po,r) meets none of the 
sets in /C then we are done, so suppose B{pQ,r) intersects the set in a point w. 
Then we have 

duiPo,lPo) < du{po,w) + duiw,'jpo) < du{po,w) +du{w,po) < 2r. 

Hence, the ball B{pq, 2r) contains 7Po- Now, the closure of this ball is compact, and 
PS'L(2,Z) is discrete, so by Lemma 12.6. 15^ B{pQ,2r) contains only finitely many 
points of the orbit Fpo- Thus B{po,r) C B{pQ,2r) meets only finitely many of the 
sets K^. Therefore is a locally finite family of closed sets in U"^. This means 
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that the union of the sets in K, is closed. So D{pq), which is the complement of this 
union, is open. 

We now construct a fundamental set F and show that D{pq) C F C D{po). 
From each orbit Tz, we may select a point which is the minimum distance from the 
point Pq (if there is more than one closest point, choose any). Let F be the set of 
these points. Then F is a fundamental set for F. 

Let z be in D{pq) and 7 7^ 1 in F. Then, since 7 is an isometry, 

duiz,po) < duiz,-fpo) = dij-^z.po), 

so z is the unique closest point of the orbit Vz to Pq. Hence D{pq) C F. 

To show F C D{pq), let z be in F. If 2; = pq then z G D{pq) immedi- 
ately, so we assume that z 7^ Pq. Let 7 7^ 1 be in F. If du{z,po) > duizj'jpo), 
then du{z,po) > d{'y~^z,po), which contradicts our construction of F. Therefore, 
du{z,po) < duizjjpQ). Let [po,z] be the geodesic segment in U"^ joining the points 
Po and z, and let w he a. point in the interior of this segment. Then 

du{z,w) + du{w,po) = du{z,po), 

so 

du{w,po) = du{z,po) - du{z,w) < du{z,-fpo) - du{z,w) < du{w,jpo), 

with equality only if du{z,po) = du{z,'ypQ) and du{z,'ypo) = du{z,w) + du{w,'ypQ). 
Suppose we have equality. Then [z,w] U [w,7Po] is the geodesic segment [z, 7Po]- 
The segments [z,po] and [2;, 7Po] have the same length, and both extend [z,w], so 
Po = IPo- This contradicts the stabiliser subgroup Tp^ being trivial. Thus we have 
the strict inequality 

duiw,po) < duiw,'jpo) 

for all 7 7^ 1 in F. Hence, w G D{po). This implies that the half-open segment [po, z) 
is contained in -D(po), and so z G -D(po)- Therefore, F C D{po). By Lemma [2.6.17[ 
D{pq) is a fundamental region. 

We have shown that, for any w G D{po), the geodesic segment [po, w] is in D{pq). 
This means -D(po) is connected. Therefore -D(po) is a fundamental domain. 

Finally we show that -D(po) is a locally finite fundamental domain. Write D for 
D{po). Let r > 0, and suppose the ball B{po,r) meets 'jD for some 7 G F. Then 
there is a 2; in D such that du{po,1z) < r. Moreover, 

du{po,lPo) < du{po,'yz) + d(7z,7Po) 

< r + duiz,po) 
<r + duiz,'j~^po) 
= r + du{'Jz,po) 

< 2r. 

Since F is discrete, du{p,'-fpo) < 2r for only finitely many elements 7. Hence, the 
ball B{pq, r) meets only finitely many sets 'jD. □ 
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Let T be the generahsed hyperbohc triangle with vertices at ±i + and oo, 
and let T° be the interior of the triangle T. This triangle is pictured overleaf. 
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Proposition 2.6.19. Let po = ti for some t > 1. The Dirichlet domain D{po) is 
equal to T° . 



Proof. pLj Let m = ( ^ ^1 and v 



1 
-1 



. Then u and v are in PSL{2,Z), so 



D{po) c (HM n HM n h^-.{po)) • 

We have up^ = + 1, vpQ = ijt and m~Vo = ti — \. Thus the geodesic Re(2;) = 1/2 
is the boundary of Hu{pq), and Hu{po) = {z E : Re{z) < 1/2}. The boundary of 
Hu-^ipo) is the geodesic Re(z) = —1/2, and H^-iipo) = {z e U"^ : Re{z) > —1/2}. 
For H^[pq), all points in the intersection of the circle |z| = 1 with f/^ are the same 
hyperbolic distance from po as from vpQ. Thus, this circle, which is a geodesic since 
it is orthogonal to the real axis, is the boundary of Hy{po), and H^{pq) is the set 
{z ^ : \z\ > 1}. The three boundary geodesies form the sides of a generalised 
hyperbolic triangle with one vertex at infinity. The finite vertices are at ±| + ^i. 
Hence, Hu{po)^Hv{po)nHu-i (po) is the interior of the triangle T. Thus -D(po) ^ T°. 

Suppose D{pq) ^ T°, and write D for D{pq). Then there is a point z E T° such 
that z ^ D. Since D is a fundamental domain, f/^ = {7-D : 7 G F}, hence there is a 
7 e F, 7 7^ 1, such that z G 7-D. Now, the action of F on f/^ is by homeomorphisms, 
and D and T° are both open sets, with D ^ T°. Thus T° must properly contain 
the boundary of D. So we may, in fact, find some 7 G F such that z G 7-D. Let 
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w = '-f E D. Then, since D C T° , w is in T° . We show that this imphes a 
contradiction. 

Let 5 be 7~^, and suppose that 6 = ^ ^ . Then, since 2; is in T°, \z\ > 1 and 
Re{z) < 1/2, so 

\cz + d\'^ = c^|z|^ + 2cdRe{z) + 
> (? - \cd\ + £■ 
= (|c| - \d\f + \cd\. 

This lower bound is a non-negative integer. It is zero if and only if c = = 0; since 
ad — be = 1 this is impossible. Therefore, \cz + dp > 1. Thus, 

that is, lm{w) < lm.{z). Now replace z hy w and 5 by 7 in the above argument. 
We find that Im(7w) < lm.{w), hence lm{z) < lm{w). This is a contradiction. 
Therefore, D = T°. □ 

We conclude this discussion of fundamental domains by showing how they are 
related to tesselations, and using this relationship to find generators for the groups 
PSL{2, Z) and SL{2, Z). A convex polyhedron in a metric space X is a non-empty, 
closed, convex subset of X with finitely many sides and a non-empty interior. A 
tesselation of X is a collection V of convex polyhedra in X such that 

1. the interiors of the polyhedra in V are mutually disjoint, and 

2. the union of the polyhedra in V is equal to X. 

By comparing this definition with that of a fundamental domain, it can be seen 
that, since T° is a locally finite fundamental domain, T = {7T : 7 e P5'L(2,Z)} 
is a locally finite tesselation of U'^. Moreover, it is an exact tesselation, meaning 
that each side of a triangle 7T is a side of exactly two triangles 7T and 6T in T. 
An exact tesselation V is said to be connected if for each P,Q E V there is a finite 
sequence Po, Pi ... , Pm in V such that P = Pq, Q = Pm, and for each 1 < i < m, 
the polyhedra Pj_i and Pi have a common side. The tesselation T is pictured in 
Chapter 1 on page M 

Lemma 2.6.20. The tesselation T of f/^ is eonneeted. 

Proof. Let 5T be a triangle in T, and let V be the union of all the triangles 7T where 
there exists a finite sequence 70T, 71T, . . . , 7mT such that 5T = JqT, 7T = 7mT, 
and the triangles 7j-iT and 7iT share a common side for 1 < i < m. Since V 
contains ST, V is non-empty. The tesselation T is a locally finite collection of 
closed subsets of U"^. Hence, V is closed. 

We now show that the set V is also open. Let z be in V. If z is in the interior of 
some triangle 7T, then since an interior is open, there is an e > such that the ball 
B{z,e) is contained in the interior, and so this ball is contained in V. The other 
possibility is that z lies on the side of some triangle in V. Choose e so that B{z, s) 
meets only the triangles in T which have z on one of their sides. Let S{z, e) be the 
hyperbolic circle centred at z of radius e. Then T restricts to an exact tesselation 
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of S{z,e). This restriction is a connected tesselation, so any triangle in T which 
has 2; on a side is in V. Thus V contains B{z,s). We have shown that V is both 
open and closed, and is non-empty. Since U'^ is connected, this means V — U'^. □ 

Proposition 2.6.21. The group T = PSL{2,Z) is generated by the set 

E = {7 e r : T n 7^ is a side ofT}. 

Proof. Let 7 be any element of F. Then, since T is connected, there is a finite 
sequence of elements 70, 71, . . . , 7^ of F such that T = 70T, •jraT = jT, and ji-iT 
and ^yiT share a common side for 1 < i < m. Note that — I and 7^ = g- Since 
"Yi'-i'Ji-iT = T, the triangles T and 7~_\7jT share a common side for each i. We 
may assume that for each i, 7i_i 7j. Then 7jl\7i G S. The product 

7o(7o"Sl)(7r^72) • • • (7m-l7m) = 7 

is thus a product of the identity 70 and elements of S, hence S generates F. □ 
Corollary 2.6.22. The group SL{2,Z) is generated by the matrices u — 



and V — 
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Chapter 3 
Symmetric Spaces and Distance Functions 



3.1 Introduction 

The upper half-plane U'^ is a metric space with distance function du- The group 
PSL{2, R) acts on C/^ by isometries, and a tesselation of C/^ is induced by the action 
of the discrete subgroup PSL{2,'Z). It is natural to ask if, for 7 G PSL{2,'L) and 
z G f/^, there is any relationship between the hyperbolic distance djj{z^ 72;), and the 
number of tiles of this tesselation which are crossed by the geodesic segment [2;, 72;]. 
Now, the tiles in this tesselation of U"^ are the images under elements of PSL{2, Z) 
of the triangle T with vertices at ±^ + and 00. Moreover, the group PSL{2, Z) 
may be generated by those elements u and v which map T to a triangle adjacent 
to T. Thus, it seems there is some relationship between the number of tiles crossed 
by the geodesic segment [z, 72;], and the expression for 7 in terms of u and v. 




■* 



-2-1012 

In this chapter, we present the theory needed to more precisely frame questions 
about, and analyse, these kinds of relationships between distances, tcssclations and 
generators. The upper half-plane is an example of a very general concept called a 
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symmetric space. Where (X, d), a symmetric space, is acted on by a discrete group 
of isometries F, we define two distance functions on F, and introduce a notion of 
equivalence of distance functions. The geometric distance function on F is induced 
by the distance function d on X. The word distance function on F, an algebraic 
concept, is induced by the generators of F. The remainder of this thesis investigates 
the circumstances in which the geometric and word distance functions are equiva- 
lent. We prove that, in spaces where the tiles of the tesselation are compact, these 
two distance functions are indeed equivalent. Next, so as to generalise the action 
of PSL{2,'R.) on f/^, we describe a space P(") on which PSL{n,R), n>2, acts by 
isometries. Chapter 4 comprises a proof that the geometric and word distances on 
PSL{n, Z) are not equivalent for n = 2, but they are equivalent for all n > 3. 

3.2 Symmetric Spaces 

Let X be a differentiable manifold which is also a metric space, and let ci be a 
Riemannian distance function on X (see [8] for a discussion of Riemannian metrics 
and Riemannian distance functions). Suppose there exists a group G which acts 
transitively and by isometries on X. Also suppose there exists a special point, say 
0, in X, and an element k & G, such that k o = and Dk{0) = —I. Then, 
(X, d) is said to be a symmetric space. (The element k is actually a refiection in 
the point 0.) 

The upper half-plane is a symmetric space. Considered as an open subset of M^, 
it is a two-dimensional differentiable manifold, and it is a metric space with distance 
function du. Let G be the group SL{2,'R). We know that G acts by isometries on 
X. For transitivity, it suffices to show that for all z & U'^, there exists a. g E G so 
that gi = z. Let z = x + iy he in U'^ , and let g E G be the matrix 



Other examples of symmetric spaces are hyperbolic n-space if", Euclidean n- 
space E", and spherical n-space S"". There are also symmetric spaces of matrices, 
one of which is described in Section 13.51 below. A space which is not a symmetric 
space is an ellipsoid. There is no transitive group of isometries for this space. For 
example, reflections which are not in the axes of the ellipsoid do not preserve the 
space. 

It turns out that symmetric spaces have the features needed to construct tes- 
selations like that of the upper half-plane, as described in Subsection 12.6.31 More 
precisely, let X be a symmetric space which is acted on by a group of isometries G. 
The group G has a normal subgroup, say N, consisting of all elements of G which 
fix every point of the space X. (If X is the upper half-plane and G = SL{2, M), this 
normal subgroup is {±J}.) Then, the quotient group G/N has a discrete subgroup 
F, such that for some point po in X, the stabiliser subgroup Fp^ is trivial. The 
Dirichlet domain D with centre po is a locally finite fundamental domain for the 
action of F on X. The closure of D is a polyhedron which, acted on by F, yields a 




Then gi = z. If k is the element 
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tesselation of X. This tesselation is locally finite, and any compact set in X meets 
only finitely many tiles of the tesselation. Finally, the group F may be generated 
by those finitely many elements of F which map Z) to a tile which shares a side with 
D. For the details of the considerable amount of theory needed to establish these 
results, see Chapter 4 of [H] and Chapter 6 of [21j. 

3.3 Distance functions on discrete groups 

Let F be a discrete group of isometrics of a symmetric space X and let d and d' be 
distance functions on F. Then d and d' are said to be Lipschitz equivalent if there 
exist constants c, C > such that, for all 71, 72 G F, 

cc?(7i,72) < c^'(7i,72) < C'c^(7i,72)- 

Lipschitz equivalence is an equivalence relation. We write d ^ d' ii distance func- 
tions d and d' are in the same equivalence class. 

Lipschitz equivalence allows distance functions on F to be compared. We now 
define the two distance functions that we wish to compare. 

3. 3. 1 The geometric distance function 

Let po be a point in (X, d) such that the stabiliser subgroup Vp^ is trivial. A 
geometric distance function dR on F is defined by 

'^i?(7i,72) = c?(7iPo,72Po)- 

The letter R is used since c? is a Riemannian distance function. From the properties 
of the distance function d on X, it follows immediately that d^ is non-negative 
and symmetric, that dn satisfies the triangle inequahty, and that 71 = 72 implies 
(^i?(7i,72) = 0. Now suppose ciH(7i,72) = 0. Then (i(7ipo, 72P0) = 0, and so 
7iPo = 72^0- Therefore, po = 7r^72P0) meaning that 7]~"^72 is in the stabiliser 
subgroup of Pq. But this subgroup is trivial, hence 71 = 72- Thus d^ really is a 
distance function. 

The point po used to define the geometric distance function is not, in general, 
the only point of X whose stabiliser subgroup in F is trivial. For instance, in the 
upper half-plane, all points ti, where t > 1, have trivial stabiliser subgroups in 
PSL{2, Z). The geometric distance functions which are then defined using, say, the 
points 2i and 3i will not be the same. For our purposes, this difference does not 
matter. We are interested in Lipschitz equivalence classes of distance functions, and 
the following result shows that all geometric distance functions on F acting on the 
space (X, d) are Lipschitz equivalent. 

Lemma 3.3.1. Let T be a discrete group of isometrics of the symmetric space 
(X, d) . The geometric distance function on F depends on the choice of point in X 
with trivial stabiliser subgroup, but only up to Lipschitz equivalence. 

Proof. Let ci^ and be two geometric distance functions defined with respect to 
the points p and q in {X,d). Let 71 and 72 be elements of F. By the triangle 
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inequality, and the fact that F acts by isometries, 

(^(71,72) = rf(7iP,72P) 

< d(7ip, 71?) + d(7ig, 72?) + d(72g, 72P) 
= 2d(p,g) + 4(7i,72). 

As the group F is discrete, there is an > such that rf|.(7i,72) > for all 
distinct 71 and 72. Choose the constant Ci so that 2ci(p, q) < (Ci — l)rq. Then, for 
all 7i 7^ 72, 

2d{p, q) + 4(71, 72) < (Ci - l)r, + 4(71, 72) 

<(Ci- 1)4(71,72) + 4(71,72) 
= C'i4(7i,72)- 

Thus 4(7i'72) ^ C'i4(7i,72) for all 71, 72 G F. Similarly, there exists a constant 
C2 > such that 

4(71,72) <C24(7i, 72) 
for all 7i and 72 in F. Combining these results, 

^4(71,72) < 4(71,72) < (^24(71,72), 

which completes the proof. □ 

This lemma shows that changing the point po in the definition of a geometric 
distance function docs not affect Lipschitz equivalence. Now, the definition of 
also depends on the distance function d on. X. It is reasonable to wonder whether 
the geometric distance is well-defined, at least up to Lipschitz equivalence, if the 
distance function on the space X is changed. It turns out that, because X is a 
symmetric space, any two Riemannian distance functions d, and d' on X are scalar 
multiples of each other. Hence, the respective induced geometric distance functions 
dji and 4 indeed Lipschitz equivalent. 

We will from now on abuse notation and refer to da as the geometric distance 
function on F. 

If a; is in X and 71 in F, the distance in X along the geodesic joining x and 71a; 
is d{x,'~^ix). Since F acts transitively on X, there is a 72 G F such that 72a; = po- 
Then, 

d(x,7ix) = rf(72X, 727ix) = rf(po, 727172"%) = c?ii(l, 727172"^)- 

So the geometric distance provides a way of relating a distance in the space X 
between points x and 7a; to a distance in the group F. In particular, the distance 
along the geodesic joining the points po and 7Po is Lipschitz equivalent to o?i^(l, 7). 

3.3.2 The word distance function 

To define the second distance function on F, fix a finite set of generators for F, 
denote this set of generators by S, and let be the set : (J G S}. A 

word distance function d-z on F is induced by S as follows. For 71 and 72 in F, 
c^s(7i, 72) = ^, where n is the smallest integer such that 7]~^72 may be written as a 



58 



word of length n in terms of elements of S U E~^. Let /s(7) denote the length of a 
shortest word for 7 in terms of elements of E U E~^. Then 0^2(71,72) = ^s(7r^72)- 
The word distance function corresponds to distance in the Cayley graph for F with 
respect to E. 

We now show that (is does satisfy the definition of a distance function. Since 
the identity is the only element of F which may be written as a word of length zero, 
(is (71, 72) = if and only if 71 = 72- AH words for elements other than the identity 
have positive length, so (is(7i, 72) > for all 71, 72 G F. Also, for any 71, 72 G F, by 
taking the inverse of a shortest word for 7^ ''^72, we see that (is(7i, 72) = (is(72, 7i)- 
Finally, for any 71, 72 and 73 in F, 

= c?e(7i,72) + c^s(72,73), 

proving the triangle inequality. 

Lemma 3.3.2. The word distance function on F depends on the choice of genera- 
tors, but only up to Lipschitz equivalence. 

Proof. Let E = {ai, (T2, • • • , o"n} and E' = {(t[, (T2, • • • , cr'^} be two finite sets of 
generators for F, with (is and d'^^ the respective induced word distance functions. 
For 71,72 e F, we have o?s(7i,72) = (iE(l, 7i~S2), and (is(7i,72) = c?s(l,7rS2)- 
Thus, to prove that (is ~ d'-^, it suffices to prove that for all 7 e F, there exists a 
constant C > such that 

rfE(l,7)<C^c?'s(l,7), and d'^{l,^) < C dj^il,^). 

As E and E' are sets of generators, if (Tj G E then both (Xj and (T~^ may be 
expressed in terms of elements of E' U (E')^^, and similarly if a'j G E' then a'j and 
{o'j)~^ may be expressed in terms of elements of E U E~^. 

For all 7 e F, by taking the inverse of a shortest word for 7, we find that 
d^{ln) = d^{l,^-'), and d'^{l,^) = (i's(l,7"')- So let 

Cl = max{(is(l,(Ti)}, 

l<i<n 

02 = max {(is(l,(7')}, 

l<j<m ■' 

C — maxjci, C2}. 

Then for any 7 G F, 

cis(l,7)<ci(is(l,7)<C(is(l,7), 

and 

ciE(l,7)<C2cis(l,7)<C(i's(l,7)- 
Therefore o?s ~ (i's- □ 
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As with the geometric distance function, this result allows us to abuse notation. 
If we are interested only in Lipschitz equivalence of distance functions, we will 
now write dw for any word distance function, and refer to as the word distance 
function on F. We will continue to denote by d^ the word distance function induced 
by a particular set of generators S. 

The action of F on X induces a tesselation of X. For 7 G F, the geodesic 
segment [po;7Po] crosses a sequence of tiles in this tesselation. Let T be the tile 
which has the point po in its interior. Then [po, 7Po] intersects a finite sequence of 
tiles {7iT}"^Q, say, where 70 = 1 and 7^ = 7. There is a slight comphcation if the 
geodesic segment [po, •ypo] passes through the vertex of some tile in the tesselation. 
Since the tesselation is locally finite, if this happens we can always choose a finite 
set of successively adjacent tiles meeting at the vertex. Then, we may assume that 
for 1 < i < n, the tile 7i_iT is adjacent to the tile 7jT. Thus, the tile T is adjacent 
to the tile 7j^\7jT. As in the upper half-plane, the set of elements of F which map 
T to a tile adjacent to T generate F. This means that 

7 = (7o Wl) {11^12} ■ ■ ■ iln-lln) 

is a word for 7 with respect to a finite set of generators for F. Its length is the same 
as the number of tiles of the tesselation which are crossed by the geodesic segment 
[Po,1Po]- There is no shorter word for 7 with respect to this set of generators (this 
is proved for a case when F = PSL{2, Z) in Section 14^31 and the proof given there 
generalises). Therefore, dw{^, 7) has a geometric significance as well as its algebraic 
meaning: it is Lipschitz equivalent to the number of tiles in the tesselation which 
are crossed by the geodesic segment [po,'ypo]- 

3.4 Fundamental domains of compact closure 

A straightforward example of a symmetric space is the Euclidean plane, acted on 
by the group of Euclidean isometries. Let s and t be the orthogonal translations 
s(x) = x-|- (0, 1) and t(x) = x-|- (1, 0). Let F be the discrete subgroup of isometries 
generated by s and t, and let po be the point (1/2,1/2). Then pq has trivial 
stabiliser subgroup in F, and the Dirichlet domain with centre po is the open square 
(0, 1) X (0, 1). It seems intuitively reasonable that in the tesselation induced, which 
is a grid of unit squares, the Euchdean distance between x and 7X is roughly the 
same as the number of tiles crossed by the line segment [x, 7x] . 
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The following theorem shows that this intuition is correct. In spaces, like this one, 
where the induced tesselation has compact tiles, the geometric and word distance 
functions are Lipschitz equivalent. 

Theorem 3.4.1. [IJ^] Let D he a fundamental domain for the action ofV on (X, d). 
Suppose D has compact closure. Then the geometric distance function d^ on T is 
Lipschitz equivalent to the word distance function dw on T . 

Proof. Let S be a finite set of generators for F such that S = S^^, and let d^. be 
the word distance function induced by the set S. Let 7 be in V. Choose Cq to be 
the constant 

Co = max{(i^(l, a) : a E S}. 

We prove that (iij(l,7) < Co(is(l,7) by induction on m = (is(l,7)- For the induc- 
tive step, we may write 7 = aia2 ■ ■ ■ o'm- Then, 

7) = d{po, ai(T2 • ■ ■ CTmPo) 
= d(afVo,o-2 ■ ■ -o-mPo) 

< (i((Tf po) + d{po, (72 ■ ■ ■ amPo) 
= dnia^^, 1) + dR{l, da ■ ■ ■ a^) 

< Cod^{a^^, 1) + Cods{l, ^2 ■ ■ ■ cr„) 

< Co + Co(m - 1) 
= Com 

= Corfs(l,7)- 

Thus there exists a constant Ci > such that, for all 7 G F, dR^l, 7) < Cidw{^, 7)- 
To show that dw{^,l) < C2(ij:j(l, 7), consider the geodesic segment [po5 7Po]- 
Since F is discrete, the constant Kq defined by 

Ko = mi{d{po,epo):eer} 



61 



is strictly positive. By the division algorithm, there exists a positive integer q and 
real number r, where < r < such that 

Thus, there exists a finite sequence of points Po,Pi,P2, ■ ■ ■ ,Pn = IPo lying on the 
geodesic segment [po,7Po] such that, for 1 < i < n, 

K 

-Y ^ d{Pi-i^Pi) < Ko- 

Note that 

" K 
c?r(1,7) = c?(Po,7Po) = ^d{pi-i,pi) > n^. 

i=l 

Now, the tiles of the tesselation induced by T are the images of D under elements 
of r. So each point Pi belongs to (at least one) image of D. Thus, for < i < n, 
we have pi e 7i-D, for some 7^ G T. In particular, 70 = 1 and 7„ = 7. Then, for 
1 < i < n, define to be the element = 7j^\7i G F, so that 

rira ■ ■ ■ r„ = 7071" Si72 ■ ■ ■ 7,;-i7n = lo^ln = I7 = 7. 

Let Xi be the point 7~Vj ^ D. Since d{pi_i,pi) < Kq, 

d{li}iPi-ini-iPi) = d{xi_i,'y:[:}-^'jiXi) = d{xi_i,riXi) < Kq, 

so the point r^Xj is in the ball -B(xj_i, Kq). 
Consider the set 

B= [jBix,Ko). 

Since D is compact, there is an i?o > such that 

sup{d{x,y) : x,y E D} < Rq. (3.1) 

That is, the distance between points in D is bounded above by the constant -Rq- 
Let G be the following set of elements in F: 

e = {6 ev -.eDnB ^^}. 

Then, using (13.11) . and the fact that po is in D, 

Bc[jeDC B{po,Ko + Ro). 
6>eG 

Since the closed ball B{pq, Kq + Rq) is compact, it meets only finitely many images 
of D. Thus 6 is a finite set. Therefore, if x and y are points in D, and the point 
9y is in the ball -B(x, i^o), the element 6 belongs to the finite set 9. 
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Now, for 1 < ^ < n, the points Xi-i and Xi are in D, and the point ViXi is in 
the ball B{xi-i, Kq). This means the element belongs to the set B. Moreover, 
since rir2 ■ ■ - r^ is a word for 7, and is finite and independent of the element 7, 
the set 9 is a finite set of generators for the group T. Let be the word distance 
function induced by the set 6. Then, ^9(1, 7) < n. Thus 

c?i?(l,7) > > ^de(l,7)- 

Therefore, there is a constant C2 > such that dwi^,!) < C'2C?_r(1, 7)- □ 

Note that we did not need the fact that D is compact to prove the inequality 
(ii?(l,7) <CidH'(l,7). 

3.5 A group action of PSL{n,M.) 

As described in Subsection 12.6. 11 the group PSL{2, M) acts on the upper half-plane 
by isometrics. We now describe how the group PS'L(?7,, M), where n > 2, acts by 
isometrics on a symmetric space P'-"^ of n x n matrices. First, we define P^"\ and 
show that it is a metric space. Then, we define an action of the group SL[n, M) on 
P^"'\ and show that this action is by isometrics and is transitive. Following this, we 
conclude that P*^"-* is a symmetric space. Next, we show that the upper half-plane is 
isometric to 

p(2) 

, once the distance function on f/^ is multiplied by a constant. This 
shows that the action of SL{n, M) on P*^"-* is a logical generalisation of the action of 
5/7(2, M) on f/^. Finally, we define the action of the quotient group PSL{n,M.) on 
p(") The proofs of many statements in this section depend on elementary results 
of matrix theory. See, for example, |12j . 

A linear map T : M" M" is said to be positive definite if Tx • x > for all x 
in M", and Tx ■ x = if and only if x = 0. If T is positive definite and Tx = 0, 
then Tx ■ x = 0, so x = 0. Thus positive definite matrices are invertible. 

Let P*^"^ be the set of symmetric, positive definite n x n real matrices of deter- 
minant 1. In other words, P^"-* is the subset of symmetric positive definite matrices 
in S'T(n, M). Note that the identity matrix I is in P("). The space P^'^' is a differ- 
entiable manifold of dimension ^{n + 1). 

We now show that P*^") is a metric space. From now on, we denote the Euclidean 
norm of a vector x G M" by ||x||. The operator norm of a matrix A is then, by 
definition, 

\\A\\ = sup ||Ax||. 

||x||=l 

Note that this matrix norm is different from the norm used to equip the group 
S'L(n, M) with a topology. We define the distance function dp on P*^") by 

dp{Si, S2) = log ll^f ^Ssll + log ll^a^^Sill. 

Clearly, dp{Si, S2) = dp{S2,Si). For any two n x n matrices A and B, we have 
||AP|| < ||y4||||P|| (this is a standard result for bounded operators on a Hilbert 
space; see |4]). Since / = S'f^S'2S'^^S'i, it follows that 

1 = ||/|| < ||5f^52||||52"^5i||. 
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Thus 

dp(5i,52) = log||5ri52||||52-^Si|| > 0. 
If Si = S2 then dp{Si, S2) = log = log 1 = 0. If dp{Si, S2) = 0, then 

Write M = Sf^Sa, so that ||M||||M-^|| = 1. Since M is in SL{n,R), there are 
matrices Ki and K2 in SO{n, M), and a diagonal matrix A G SL[n, M) with positive 
diagonal entries, such that M = K1AK2. (This decomposition is established for the 
case n = 2 in Lemma 12.6. 161 The general case is proved below in Lemma 13.5.21 so 
as not to interrupt the proof that P^"^ is a metric space.) Then, since H-ft'xIl = ||x|| 
for all K e SO{n, M) and all x e M", 

||M|| = \\KiAK2\\ = \\Al and ||M"^|| = WK^^A-^K^^W = \\A-^\\. 

Also, det(y4) = L So we have a diagonal matrix A with determinant 1 and 

ll^""^!! = 1. We now show that A must be the identity. 
Lemma 3.5.1. Let D be a diagonal matrix. Suppose D has positive diagonal entries 
{Ai, A2, . . . , A„}. Then the operator norm of D is 



\D\\ = maxjAjj. 

l<j<n 



Proof. We have 



= sup ||-Dx|p = sup{A^x^ + A2X2 + ■ ■ ■ + A^x^ : xl + xl + ■ ■ ■ + x"^ = 1}. 

i|x||=l 

If A/ is the maximum of the set {Ai, A2, . . . , A„}, then for x such that ||x|| = 1, 

Xlxl + Xlxl + ■■■ + \lxl < X%xl + xl + --- + xl) = \l 
So < Xj. If x = e/, then 

Xjxl + Xlxl + ■■■ + Xlxl = Xj, 

so equality is attained. □ 

Let the diagonal entries of A be {Ai, A2, . . . , A„}. We have det(y4) = 1, so the 
product Hi Aj = 1- By Lemma 13.5.11 the equation ||y4|| = 1 means that 

(max Ai)(max A^^) = 1. Hence, 



(maxAi)(minAi)-^ = 1. (3.2) 

i i 

Suppose first that maxAj > 1. Then minA, < 1, as otherwise det(yl) = Hi Aj > 1- 
It follows that 

(max Ai)(min Aj)^^ > 1, 

i i 

which contradicts (13. 2p . Therefore maxA^ < 1. But if maxAj < 1, det(y4) < L 
Hence, maxAj = 1. For the minimum diagonal entry, we now have minAj < 1. If 
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min Aj < 1 then det{A) < 1, so min Aj = 1. Therefore, Aj = 1 for all i, and A is the 
identity matrix. 

Since A is the identity, M = K1K2 G SO{n, M). So 5f is in SO{n, R). Hence, 
5*2 = SiU for some C/ in SO{n,M.). As the matrices Si and 5*2 are symmetric, we 
have Si = {SiSfy/^ for i = 1, 2. Then, 

^2 = (525J)'/' = {SiUU^S^f' = {SiSiy = Si. 

This proves that dp{Si, S2) = if and only if = S2- 
For the triangle inequality, 

dp{Si,S,)=log\\S^'S4\\S^'Si\\ 

^logWSi's^s^'sMs.'s^s^'SiW 

<\0g\\Si'S2\\\\S2'SMS3'S2\\\\S2'S4 

= log ||>Sf^>S2||||-S2-^-Si II + log 11-52-^-531111-53-^11 
= dp{Si, S2) + dp{S2, S3). 

We conclude that P*^"^ with the distance function dp is a metric space. 

We now define an action of the group SL{n,R) on p("). For M e SL{n,R) and 
S e P("), the action of M on -5 is 

MoS ^ MSM^. 

Since S is symmetric, MSM'^ = {MSM'^Y. Since -5 is positive definite, 

MSM'^x ■ X = S{M^x) ■ (M^x) > 0, 

with equality if and only if M^x = 0. But e -5L(n,R), so M^x = if and 
only if X = 0. Hence MoS e P^"). Since ISI^ = -5, and 

(MiM2)5(MiM2)^ = Mi{M2SMl)Ml, 

this is indeed a group action. 

The stabiliser subgroup of / in SL{n, R) is 

{M e -5L(n, R) : M o 7 = 7} = {M e -5L(n, R) : MM^ = 7}, 

which is the special orthogonal group .50(71, R). This fact is used to prove the 
following decomposition of SL{n,R). 

Lemma 3.5.2. Let M e SL{n,R). Then there exist matrices Ki and K2 in 
-50(n, R); and a diagonal matrix A in SL{n,R) with positive diagonal entries, such 
that M = K1AK2. 

Proof. First, M o I — MIM'^ is a symmetric positive definite matrix. Since it 
is symmetric, there exists an orthogonal basis of eigenvectors of M7M^. If K is 
the orthogonal matrix which has these eigenvectors as columns, then the matrix 
KMIM'^K-^ = KMIM'^K^ is diagonal, and the diagonal entries of KMIM^K'^ 
are the eigenvalues of MIM^^ . By multiplying the columns of the matrix K by 
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a suitable constant, we may assume that det(i^') = 1, so that K G SO{n,M.). 
As the matrix MIM'^ is positive definite, its eigenvalues are real and positive. 
So there exists a diagonal matrix D such that KMIM^K^ = D^. Note that 
det(L>) = y/det{D^) = 1, so D G SL{n,R). Then 

D-^KMIM^K^D-^ = {D-^KM)I{D-^KMf = (D-^KM) ol = 1. 

As the group SO{n, M) is the stabiliser subgroup of I in P^"'\ we have D~^KM = K' 
for some K' G SO{n,M.). Making M the subject and then relabelling, we obtain 
M = K-^DK' = K1AK2. □ 

We now further investigate the action of SL(n,'R.) on P^^\ For M G SL(n,'R.) 
and Si,S2e P("\ 

dp{MoSi,MoS2) = \og\\S^^M-^MS2\\\\S2^M-^MSi\\ 
= log ll^f ^5*211 ||S'2"^S'i|| 
= dp{Si, S2). 

Thus S'L(n, M) acts by isometrics on p("). 

The action of S'L(?2, M) on P*^") is also transitive. Let S be in P^"). As in 
the proof of Lemma 13.5.21 since S is symmetric and positive definite, there exists 
K G SO{n, M), and a diagonal matrix D G SL{n, M) with positive diagonal entries, 
such that KSK^ = D\ Then 

S = K^DDK = {K^D)I{K^Df. 

Setting M = K'^D, we have M G SL{n, M) and M o J = 5". 

The only thing now required to conclude that P*-"-* is a symmetric space is the 
point in P*^"^ and element k G SL(n,'K) such that k fixes and Dk{0) = —I. 
For any n, we take the point to be the identity matrix. Then, for n = 2, take 

^ ~ (^^^ • ^'^^ n > 3, we may take as a matrix which has all zeros except for 

the diagonal running from bottom left to top right. On this diagonal, each entry is 
either 1 or —1, arranged so that det(A;) = 1. 

The following lemma motivates our investigation of the space P^'^\ as it shows 
that the upper half-plane and the space P*^^^ are isometric after adjusting the dis- 
tance function on f/^ (or P*^^^) by a constant. Lipschitz equivalence is not affected 
by multiplying a distance function by a constant. 

Lemma 3.5.3. There is a bijection cp : U'^ P*-^-* so that for some constant C > 0, 

du{z,w) = Cdp{(p{z),ip{w)) 
for all z,w ^ U'^. That is, (f is a similarity. 

Proof Let G = SL{2,R) and K = 50(2, M). Recall that K is the stabihser 
subgroup of i when G is acting on f/^ (see the proof of Lemma [2.6. 16p . and of / 
when G is acting on P^'^\ 
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Define a map : U"^ ^ P'-^^ by f{gi) = g o I. The domain of is U'^ because 
U"^ = Gi, and cp is onto P*^^-* since P*-^-* = G o I. To sliow Lp is injective, suppose 
(p{gii) = (p{g2i)- Then o / = o /, and so / = g2^gi o I, lience g2^gi £ K. Tlius 
g2^gii = i, and so (^li = (72^- Tlierefore 99 is a bijection. 

Now, for all gi,g2 G G, 

du{.gih g2i) = duii, gi^g2i), and dp{gi ol^g^ol) = dp{I, g^^g2 o /). 

So, to show that di/{gii,g2i) = Gdp{ip{gii),ip{g2i)), it suffices to prove that for all 
geG, 

duii^gi) = Gdp{I,go I). 

By Lemma [2.6.161 we may write g as kiak2, where ki,k2 G K and a = 

for some s > 1. Since K stabilises i in U"^ and / in P*^^\ du{i,gi) = du{i,ai) and 
dp{I,g o I) = dp{I,a o /). In f/^, ai = s'^i, and du{s'^i,i) = 2 logs. In P^'^\ 

Then, by Lemma [3.5. ![ ||ao/|| = || (ao/)~^|| = s^, so dp{ao I ^ I) = 2 log = 4 logs. 
Thus, with G = 2, du{gii, (72^) = Gdp{gi o /, (72 o /). □ 

We now, finally, define the action of the group PSL{n,R) on p("). Let M be 
in SL{n, R) and 5 in p("). Then M^M^ = (-M)5(-M)^, so M o 5 = (-M) o 5. 
Write [M] for the coset of M in the group PSL{n, M). Then [M] o = MSM'^ is a 
well-defined group action of PSL{n, M) on the space P^"'\ So, the group PSL(n, M) 
acts transitively and by isometrics on the metric space 
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Chapter 4 
Equivalence of Distance Functions on PSL{n^ Z) 



4.1 Introduction 

We have, in Chapter 3, described the action of the group PSL{n,M) on the sym- 
metric space P*^"\ and shown that this action is a generahsation of the action of the 
group PS'L(2,]R) on the upper half-plane. In Chapter 4, we consider the Lipschitz 
equivalence of distance functions on the discrete subgroup PSL{n, Z) of PSL{n, M). 
The content of this chapter is an elaboration of the paper [T3j by Lubotzky, Mozes 
and Raghunathan. 

A fundamental domain for the action of PSL(n, Z) on P^"^ does not have com- 
pact closure [10]. So Theorem 13.4. II does not apply to PSL{n^7j). Our aim in this 
chapter is to prove the following theorem. 

Theorem 4.1.1. Let d he an integer greater than or equal to 2. Let T he the group 
PSL(d,Z) acting on the space P^'^\ When d = 2, the geometric distance function 
dji on r is not Lipschitz equivalent to the word distance function dy/ on T . For all 
d>?), the distance function d^ on T is Lipschitz equivalent to the distance function 
dw on r. 

(The switch to the letter d is so that the letter n is available to be used in proofs.) 

Before proving Theorem I4.1.H we will discuss briefly how the distance func- 
tions dpt and dw are actually defined for the group PSL{d,Z). Then, we prove 
Theorem 14.1.11 for the case where F = PS'L(2,Z). To prove Theorem 14.1.11 for the 
case r = PSL{d, 7j), where > 3, it suffices to prove that there exist constants 
Ci7 C2 > such that, for all 7 G F, 

c^ij(l,7) < Cidw{^,7), and dw{l,'j) < (72^^(1, 7)- 

The proof is then broken into three main steps. 

1. The inequality (iij(l,7) < Cidw{l,'y) can be established in exactly the same 
way as in Theorem 13. 4. 1[ 

2. The element 7 G PSL{d,7j) is the coset of some matrix g G S'L((i, Z). There 
exists a constant i^" > such that dwO-^l) < -^log||5'||, where \\g\\ is the 
operator norm of the matrix g. This step is Theorem 14.4. 1[ 

3. We conclude, using the definition of the distance function dp on p("\ that 
dw{l,l)<C2dR{l,^). 

The proof of Theorem 14.4.11 forms the bulk of this chapter. 

We will often need to compare the growth of functions, and so state here a useful 
definition. Let / and g be functions on a discrete set X. We say that the function / 
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is 0{g), or f = 0{g), if there exists a constant C > 0, and a finite subset Xq C X, 
such that f{x) < Cg{x) for all a; G X \ Xq. The following lemma is also useful. 
Lemma 4.1.2. Let f and g be functions M — >• M. Then f{x) < g{x) for all x > xq 
if fi^o) < g{xo), and f'{x) < g'{x) for all x > xq. 

4.2 Distance functions on P5I/((i, Z) 

Here we discuss briefly how the geometric and word distance functions on the dis- 
crete group PSL{d, Z) are defined. 

The definition of the geometric distance function is with respect to some matrix 
5*0 e P^'''^ which has a trivial stabiliser subgroup in PSL{d, Z). Setting 5*0 to be the 
identity / does not suffice, because if K is any orthogonal matrix then K o I — I . 
However, there does exist a suitable 5*0 arbitrarily close to / in P'^'^\ For example, 
when d = 2, for small e the matrix 



has a trivial stabihser subgroup in PSL{2,Z). 

For the word distance function, it will often be convenient to consider words in 
the group SL{d^7j) rather than in PSL{d^'L). For g G SL{d,'L), write [g] for the 
coset of g in PSL{d^7j). Then if gig2 ■ ■ ■(y'n is a word for g in terms of elements of 
SL{d,Z), a word for the coset [g] is [5'i][5'2] ■ ■ ■ [gn]- Going in the other direction, if 
[9] = [9i\[92] • • • [9n] is a word in PSL{d, Z) then, in SL{d, Z), either g = gig2 • ■ ■ gn, 
or g = {—I)gig2 ■ ■ ■ gn- Multiplying by the matrix —I adds 1 to the word length, 
which will not affect Lipschitz equivalence. Thus, it is reasonable to move back and 
forth between words in SL{d,Z,) and words in PSL{d,Z,) when constructing words 
and proving results about the word distance function dw We will usually write 
7 both for the coset 7 e PSL{d,Z), and for an element g e SL(d,Z) such that 
[9] = 7) we may describe 7 as a matrix rather than a coset. 

We now describe a particular finite set "£4 of generators of PSL{d^'L). This 
set contains the (coscts of the) following matrices. First, it contains the upper- 
triangular matrices which have all diagonal entries equal to 1, and one 1 above the 
diagonal. The set also contains all permutation matrices which have determinant 
1. Finally, contains all other permutation matrices, but with 1 changed to — 1 
wherever this will ensure a determinant of So, generators for PSL{2,Z) are 
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and generators for PSL{3, Z) are 








^0 





M 





±1) 




\±l 




where the signs in the last set of matrices are arranged so that the determinant in 
each case is 1. The proof that the sets do generate the groups PSL{d, Z) is by 
induction on d. For the inductive step, it is possible by multiplying by elements 
of Erf U E^^ to transform a matrix 7 in PSL{d, Z) into a matrix 9 which has first 
column (1,0,..., 0)"^ and first row (1, 0, . . . , 0). Then, by the inductive assumption, 
the {d — 1) X {d — 1) matrix in the lower right-hand corner of 6 may be written 
in terms of elements of Erf_i U E^j*;^. The set of generators E^^.i may then be 
considered a subset of E^^, by embedding PSL{d — 1,Z) in the lower right-hand 
corner of PSL{d, Z). 

4.3 Proof of Theorem [¥TT] where T = PSL{2, Z) 

The group PSL{2, Z) may be generated by m = ^ and v = (^^i • 

E = {u, v} and let rf^ be the word distance function induced by the set of generators 
E. We show that there is no Ci > such that dw{^, m") < CidR{l, m") for all n > 2. 

Let n > 2. Then dj]{l,u"') < n. To show that dj]{l,u"') > n, we consider the 
tesselation of the upper half-plane induced by the action of F = PSL{2,Z). Recall 
that the point Pq = 2i lies in the tile T, where T is the hyperbolic triangle with 
vertices at ±| -|- and 00. Let W1W2 ■ ■ - wi be any word for -u" in terms of the 
elements of EUE~^. Let Wq be the identity in F. For 1 < j < /, let Wj be the partial 
word W1W2 ■ ■ - Wj. Then the sequence of partial words, {W,}j=0' corresponds to a 
sequence of tiles, {W)T}^-^q, such that WqT = T and WiT = u"'T. In this sequence 
of tiles, for each j such that 1 < j < /, the tiles Wj-iT and WjT share a boundary. 
For A; G Z, let Lk be the geodesic of f/^ which is given by Re(z) = k + 1/2. Then 
is a boundary between pairs of tiles. See the picture of the tesselation on page M 
Since u^po = n + 2i, the n boundaries Lq, Li, . . . , all lie between the points po 
and u"'po. Thus, the sequence {WjTYj^Q must contain at least n successive pairs 
of tiles which have a geodesic L^. as their common boundary. Therefore, I > n. So 
> n; hence ^^(l,^") = n. This implies that the word distance dw{i,u"') 
grows linearly in n. 

For the geometric distance function, we use Lemma 13.5.31 which states that the 
upper half-plane is isometric to P*^^-* once the distance function on one of these 
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spaces is multiplied by a scalar. Multiplying a distance function by a scalar does 
not affect Lipschitz equivalence, so there is some constant C > such that 



/ r?2 

dR{l,un = dp{So,u^ o ^o) = Cdu{po,uyo) = Ccosh-i M + - 

The function cosh~^(l + n^/8) is 0(log?2), so dji(l,u'^) is O(logn). But there is no 
constant Ci > such that n < Cilogn for all n > 2. Hence, there is no Ci > 
such that < Ci(ij:j(l, m") for all n > 2. So, in this case, the geometric 

distance function dji is not Lipschitz equivalent to the word distance function d^. 

4.4 Proof of Theorem [4TT] where T = PSL{d, Z) and d > 3 

The first step in this proof is to show that there exists a constant Ci > such that, 
for all 7 G r, 

(iR(l,7)<CidH/(l,7)- (4.1) 

As claimed in the introduction to this chapter, the proof in Theorem 13.4.11 suffices. 
This is because the proof of inequality (14. ip in Theorem 13.4.11 did not depend on 
the compactness of tiles in the tesselation induced by V. Indeed, inequality (14.11) 
holds in the case F = PSL[2, Z). 

The second step in the proof of Theorem 14.1.11 for the case c? > 3 is the following 
theorem. We denote by \\g\\ the operator norm of a matrix g G SL^d^Tj), that is, 
ll^fll = sup||x||=i \\g^\\- Now, if G SL{d, Z), then ||5f|| = ||— (^H. So, if 7 G PSL{d, Z) 
is the coset 7 = [g], then it is well-defined to declare the norm of 7 to be ||7|| = \\g\\. 
Theorem 4.4.1. Let d be an integer greater than or equal to 3. Then there exists 
a constant K > such that, for all •y G PSL{d,Z), dwi^,!) < -^log||7||. 

Before we prove Theorem 14.4. 1^ we show how it implies that there exists a 
constant C2 > such that, for all 7 G F, 

dw{l,l)<C2dR{l,y). (4.2) 

Inequality (14.21) is the final step in the proof of Theorem I4.1.1[ Applying Theo- 
rem U3]T] to first 7 then 7"^, we obtain 

^1^(1, 7) < Klog II7II, and (ipv-ll, 7~^) < -f^log ||7"^||. 

Now, (ivi/(l,7) = '^1^(1,7""'^), which gives us 

c?w(l,7) < yloghllh^i- (4.3) 

Next, we consider distances in the space P^'^\ The distance between the identity / 
and the matrix 7 o / is 

dp{1, 70/) = dp{1, 77^) = log 1177^11 II (77^)^' II = log 1177^11 II (7~')^7"' II • 

It is a standard result that if A is a bounded operator on a Hilbert space then 
||y4|p = ||v4*||^ = ||y4y4*||, whcrc A* is the adjoint of A; see [4j. Applying this to the 
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matrix 7, which has adjoint 7^, we obtain 



(ip(J, 7 o /) = log ||7||^||7 "^11^ = 21og II7IIII7 """ll- 
Combining this with (14 .Sp yields 

dw{l.l)<'^dp{I,^oI). (4.4) 

The geometric distance function on V is (1^(1, '~^) = (ip(5'o,7 o ^o). Since T is 
discrete, there is a constant ro > such that for all 7 G F, with 7 7^ 1, we have 
cIr^I, 7) > tq. Choose the constant C so that 2dp{I, Sq) < (C — l)ro. Then, by the 
triangle inequality, for 7 7^ 1, 

dp{1, 7 o /) < dp{I, So) + dp{So, 7 o S'o) + dp{'j o 5*0, 7 o /) 
= 2dpiI,So) + dnil,l) 

< iC-l)ro + dnil,l) 

< {C-l)dR{l,^) + dnil,7) 

Hence, using f l4.4p . we conclude that there exists a constant C2 > such that, for 
all 7 e r, 

dw{l,l)<C2dn{l,l). 



4-4- i Outline of the proof of Theorem 4-4- ^ 



Since Theorem 14.4.11 seeks to bound the word length of 7 G F, we begin by, in 
Subsection I4.4.2[ constructing a special word 

7 = 8162 ■■■5d-2. 

Each of the terms 5i in this factorisation belongs to some copy of PS'L(2,Z) in 
PSL{d, Z). We want to relate the word for 7 to the norm of 7. Since the norm of 
7 depends on the entries of 7, we show that, in fact, the elements 6i may be chosen 
so that the entries of each 5, are bounded by a fixed polynomial in the entries of 7. 

Next, in Subsection I4.4.3[ we find a bound on the word length of any 6 belonging 
to some copy of PSL{2, Z) in PSL{d, Z). Let dw be the word distance function on 
PSL{d, Z). We show that there exists a constant Ki > such that, for all such 6, 

dw{l,6)<K,\og\\6\\. (4.5) 

To establish this inequality, we first consider the elementary matrices which have 
one above-diagonal entry 1, all diagonal entries 1, and all other entries 0. We show 
that if 6 is such a matrix, then, roughly, dw{i,S^) = C'(log|n|). It is here that 

d > 3 is necessary, for if = 2 and = |^ ' then dw{^,S"') 7^ 0(log|n|). 

Next, we restrict our attention to P5'L(2,Z). We construct a special word for any 

6 G PSL{2, Z) in terms of only the matrices ^ ^ and , where n E I^. 
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Using the tesselation of the upper half-plane, we show that a particular function 
defined on the terms in this special word is bounded by the hyperbolic distance from 
Po to Spo- Then, we show that this hyperbolic distance is bounded by a logarithm 
of \\S\\. Combining these results allows us to prove inequahty (14. 5p . 

In Subsection I4■4■4^ we prove an inequality relating the operator norm of a 
matrix 7 to the entry in 7 which has maximum absolute value. Subsection 14.4.51 
combines the above results to complete the proof of Theorem 14.4.11 

4-4-2 Factorisation of •y G PSL{d,Z) 

Since a word in SL{d, Z) yields a word in PSL{d, Z), the statements of Lemma [4. 4. 21 
and Corollary 14.4.31 here are in terms of the group SL{d,Z). 

First, some definitions. A vector x = {xi,X2, ■ ■ ■ , Xd)'^ in 7/ is said to be uni- 
modular if the greatest common divisor of its components, gcd(xi, 0:2, . . . , x^), is 
1. Some elements of the set of integers {xi,X2, ■ ■ ■ ,Xd} niay be zero. If, for some 

1 < J < c^; we have Xj = 0, then define 

gcd(xi, 0:2, ...,Xd)= gcd(xi, X2, . . . , Xj-i, Xj+i, ...,Xd). 

If xi = X2 = ■ ■ ■ = = 0, define gcd(xi, X2, ■ ■ ■ , Xd) = 0. Next, for 1 < i 7^ j < c?, 
we will denote by Eij{t) the elementary matrix having t at the {i,j) entry, all 
diagonal entries 1, and all other entries 0. Another definition needed is that of the 
group SL''\2,Z), where 1 < s ^ t < d. This is the copy of SL{2,Z) in SL{d,Z) 
which fixes the basis vectors for j not equal to s or t. For example, if d = 3 and 

ran au\ ^ 5L(2, Z), then 

{/ an ai2 0\ "I f /an ai2\ ] 

U21 ^22 M>, and SL^'\2,Z) = l 1 M> . 
V 1/ J { Va2i a22/ J 

Lemma 4.4.2. Let x = (xi,X2, . . . yXd)"^ ^7/ he a unimodular vector. Then there 
is an element 71 G {Eii{l) | z 7^ 1} U {/}, and elements 7^ G SL^'^{2,'L) for 

2 < i < d, such that 

7rf- ■ ■7271X = (1,0, . . . ,0)"^. 

Moreover, there is a fixed polynomial p in the components of x, such that, for 
1 < i < d, and 1 < k,l < d, the {k, I) entry of •ji satisfies 

\hi)kl\ < \p{Xi,X2,...,Xd)\. 

Proof. Since x is unimodular, x 7^ 0. If the first component Xi is non-zero then 
just put 7i = /. If Xi = 0, then xj 7^ for some 2 < j < d; put 71 = Eij{l). Then 
the first component of 71X is non-zero. 

For i = 2,3, . . . ,d in turn, we construct 7^. Let y* be the vector 

= {yi, y\,..., y^f = 7,_i ■ ■ ■ 7271X. 
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By the construction which follows, we have y\ ^ 0, and = for 1 < j < i. Let 
tti be the greatest common divisor of y\ and yl- Then, by the Euclidean algorithm, 
there exist integers u and v such that uy\ + vy] = ai. 

We claim that u and v may be chosen so that \u\ and \v\ are both less than or 
equal to the polynomial {y\y + {yf}"^- If yl = 0, then with u — 1 and v = Q the 
claim follows immediately. If \y\\ = \yl\ ^ then, again, put u = 1 and v = 0. 
Otherwise, we may suppose without loss of generality that \yl\ > \yl\. Now, since 
uyl + vyl = Oj, it follows that 

{u + myl)yl + {v - my\)y\ = 

for all m e Z as well. We may now choose m so that \u + myW < \yl\/2. Then, 
relabel u+myl as u and v — my\ as v, so that \u\ < < \y\\- Because \y\\ ^ \y\\, 

the greatest common divisor is not equal to y\ or to — yj, thus \ai\ < \y\\/2. So, 
for 1^1, we have 



Oi _ _u . 

y\ yf^ 



<TT.+Ti-^y\\< + \\y\\ < \\y\\ + = \y\\ 



a,- 


\u\ 








\yi\ 



And thus, since |m| < \yl\ and 1^1 < \y\\, it follows that \v\ < {y\Y + (2/^)^- 

Since is a factor of y\ and y^, there exist integers hi and hi such that y^^ = aihi 
and = ajfej. Put z — hi and w = —hi, then 

= -uy]^ + vyl = ttiiuhi + vbi) = ai{uz — vw). 

f XI v\ 

Therefore the matrix ( \ is in 5'L(2, Z). Then, 

w z) \yi) \-hiyi + hiyi 

Note that —biy{ + biyl = —biGibi + biUibi = 0. It follows that 

u v\ f y\ 
w z) \y\J \Q 

The integers \z\ and \w\ are bounded by the same polynomial as \u\ and \v\. We 
have 

\ai\ 

and, similarly, \'w\ < {y{y + (y-)^. 



Let 7i be the matrix ( ^ ^ ) e 'S'Li'^((i, Z). Then 



TiY' = 7i7i-i • • • 727ix = {tti, 0, . . . , 0, y]+i, . . . , y^)"^. 
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By our construction, yl — Xi, so 

ai^gcd{yl,yl) 

^gcd{gcd{y\-\yizl),Xi) 
= gcd{yl~^,Xi_i,Xi) 

= gcd(a;i,X2, . . . 
where x'l is the first component of 71X. In particular, 

Ud = gcd{x[, X2,..., Xd) = gcd(xi, X2, ...,Xd)^l. 

Thus we obtain 71, 72, . . . , 7^ such that 7^ ■ ■ ■ 7271X = (1,0,..., 0)"^. 

We have shown that the entries of 7^, for 2 < i < d, are bounded by the 
polynomial {ylf + (yi)^ = (gcd(?/r\ ytDf + xl Now, 

{gcd{y{-\ yizDf + x\ = (gcd(a;;, 0:2, . . . , Xi_i)Y + x,' 

<(K)' + x^ + --- + a;2_i)2 + ^2 

< (2(x? + x2 + ... + x2))2 + x2 
<5(x? + x^ + --- + a;^)2. 

Let p be the polynomial 

p{xi,X2, ...,Xd)= 5{xl + xl^ h xlf. 

Then p is a fixed polynomial in the components of x, and the entries of each 7j, for 
2 < i < d, are bounded by p. The matrix 71 is either the identity, or Eij{l) for 
some j, so in either case the entries of 71 are bounded by 1. The entries of 71 are 
thus bounded by the polynomial p as well. □ 

Corollary 4.4.3. Let 7 e SL{d,Z). Then 7 may be written as a product 

7 = 5i52 ■■ - Sd^, 

where each 5i belongs to some S'L'''*(2,Z) C SL[d,1j). Moreover, there is a fixed 
polynomial P in the entries of 7, such that, for I < i < d^, and for 1 < k,l < d, 
the {k, I) entry of 5i satisfies 

\{Si)ki\ < 1^(711, 7i2,---,7dd) I- 

Proof. We prove this corollary by induction on d. The idea is to multiply 7 by 
appropriate matrices on the left to obtain a matrix 7' with first column (1,0,..., 0)"^. 
Then, we multiply 7' by appropriate matrices on the right to obtain a matrix 7" 
with first column (1,0,..., 0)^ and first row (1, 0, . . . , 0). The inductive assumption 
is then apphed to 7". 

If d = 2 then we may let 61 — 7, and for 2 < i < d^, let 5i be the identity 
matrix. For d > 3, let x = (xi, ^2, . . . , Xd)'^ £ Z*^ be the first column of the matrix 
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7- Since x is a column of a matrix in SL^d^lt), x is unimodular. This is because 
the determinant of 7 may be calculated by expanding along its first column, so we 
may obtain the equation 

mixi + 1112X2 H h mdXd = 1, 

where the rrii, for 1 < i < d, are integers. Thus, by Lemma 14.4.21 there exist 
matrices 71, 72, • • • , 7d belonging to various copies of SL{2, Z) in SL{d, Z) such that 

7d- ■ ■727ix = (1,0, . . . ,0)'^. 

Hence, 

7' = 7d ■ ■ ■ 727i7 

is an element of SL{d,Z) which has as its first column (1, 0, . . . , 0)^. Also by 
Lemma 14.4.21 each of the matrices 7^ has entries bounded by the fixed polynomial 
p in the components xj, where 1 < j < (i. Since Xj = 7^1, which is an entry in the 
first column of the matrix 7, we may now consider p to be a fixed polynomial in the 
entries of 7. Therefore, each matrix 7j has entries bounded by a fixed polynomial 
p in the entries of 7. 

Let the first row of 7' be (1, 1/2, 7/3, ... , yd)- Multiply 7' on the right by 7373 ■ ■ ■ 7^, 

where for 2 < j < d, the matrix 7^- is ^ ^"'^ in SL^'^ (2,^). This results in a 

matrix 

7" = 7d ■ ■ ■ 727177273 ■■■Id 

with first column (1,0,..., 0)"^ and first row (1, 0, . . . , 0). We may thus consider 7" 
to belong to SL{d — 1, Z) embedded in the lower right-hand corner of SL{d, Z). 
The entries in the matrices of the form 7^-, for 2 < j < d, are bounded by the 

(11 1) \ 
^ \ G S'L^'-'(c/, Z), then 

(XL U \ 
^ ^ j ^ SL^'-^{d, Z), 

then yj = m(7ij +7fcj) + f^ljj- In either case, the entries in the matrices of the form 
jj, for 2 < j < d, are bounded by 

1/1 + 1 <(kl(|7i,l + l7.,l) + l^^ll7,,l)' + l 
<(p(7)(7i- + 7fc,+7j))' + l 
< (P(7)9(7))' + 1, 

where ^(7) is the fixed polynomial p in the entries of 7, and 5(7) is the polynomial 
q in the entries of 7 given by 

g(7ll, 712, • • • , Idd) = 2(7?2 + 7?3 + ■ ■ ■ + 7?d + 722 + 733 + ■ ■ ■ + Idd)- 

Thus the entries of the matrices of the form 7^, for 2 < j < d, are bounded by a 
fixed polynomial (^(7)9(7))^ + 1 in the entries of 7. 
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Applying the inductive hypothesis to the matrix 7", we may write 7" as a 
product 7" = 8182 ■ ■ ■ 5((i_i)2, where ioi 1 < k < {d — 1)^, each 8k belongs to some 
S'L'''*(2, Z), and each 8k has entries bounded by a fixed polynomial in the entries of 
the (d — 1) X (d — 1) submatrix in the lower right-hand corner of 7". So we may 
consider each 8k to have entries bounded by a fixed polynomial in the entries of 7". 
Now, 7" is the product of the matrix 7, and matrices of the form 7j and 7^-. So, 
each entry of 7" is some polynomial in the entries of 7, the 7^ and the 7^-. But the 
matrices 7, 7j and 7^- all have entries bounded by fixed polynomials in the entries of 
7. This means the entries of 7" are bounded by a (higher degree) fixed polynomial 
in the entries of 7. So we can find a fixed polynomial P in the entries of 7 which is 
sufficiently large to bound the entries of the 7^, the 7^- and the 8k- 

Rearranging to make 7 the subject, we now have 

7 = 71-^2-' ■ ■ ■7.-'5i52 ■ ■ ■ Vi)^(7^)~' ■ • ■ (73)-'(72)-'- 

The total number of matrices on the right-hand side of this expression is (P. The 
inverses 7"^ and (7j)~^ have entries bounded by the same fixed polynomial as for 

7j and 7'. This is because the inverse of 7^ = ( ^ ^ ) is ( ^ ^ ), and the 
' ' \w z) \-w u y' 

inverse of 7^- = is . In both cases, the bounds on the entries 

are unchanged. We conclude that 7 may be expressed as a product of the required 
form. □ 

Ji^.Ji^.S Factorisation of 8 E PSL{d,Z) 

As in Subsection 14.4.21 the results established here are in terms of the group 
SL{d,Z), rather than PSL{d,Z). Propositions gXl and SXIHl and LemmaSXSl 
are all used to prove Corollary 14.4. 14[ Corollary 14.4.141 says that there exists a con- 
stant Kq > such that, for all elements 8 G S'L*'*(2,Z) C SL{d,'L), we have 
dw{,^,8) < KQ\og\\8\\. The proofs of Propositions 14.4.41 and 144.101 are long, each 
involving several lemmas. 

We say that 7 G SL{d,Z) is a Ul-element of SL{d,Z), or just 7 is Ul, if there 
exists a constant > such that, for all integers n, 

c^ty(l,7")<C^log(|n| + l). 

Proposition AAA. The matrix 7 = Eij{l) (where I < i ^ j < d) is Ul-element 
ofSL{d,Z) ford>3. 

Proof. First, we show why it suffices to prove this result for just the case where 7 
is £^13(1) G 5'L(3,Z). Then, we identify a subgroup V of SL{3,M.) with the vector 
space M^, and identify a discrete subgroup of V with the lattice Z^. The matrix 
Ei3{l) is in this discrete subgroup. We construct a set S" C Z^ such that every 
point of is at a bounded distance from some point of S. For a vector v G S", we 
find a bound on the norm of v, and a bound on the word length of the element of 
SL{3, M) which corresponds to v. This allows us to bound the word length of those 
elements of SL{3, M) which are identified with the lattice Z^. To make the proof of 
this proposition more digestible, most of these steps are presented as lemmas. 
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To show why it suffices to prove this proposition for just the case where 7 is 
-£'13(1) £ SL{3,Z), let 7 and 6 be elements of SL{d,Z), and suppose that 7 is Ul. 
Then, there exist constants and Cs such that, for all non-zero integers n, 



<C^\og{\n\ + l) + 2dw{hS) 
<2{C^ + 2dw{l,S))\og{\n\ + l) 
<Cslog{\n\ + l). 



If n = then both sides are equal to 0. This shows that if 7 is Ul, so are all 
conjugates of 7. 

We now show that all elements Eij{l), where 1 < i ^ j < d, are conjugate to 
£^13(1) in 5'L((i, Z). We proceed by induction on d. li d = 3, then each matrix 
Eij{l), for 1 < i 7^ j < 3, may be written as SEis{l)5'^ for some S e SL{3,Z): 



The inductive assumption is that for some d > 3, all of the elements Eij{l), where 
< i ^ j < d, are conjugate to -£13(1) in SL{d,Z). Consider Eij{l) e SL{d + l,Z). 
If i and j are both less than or equal to d, then -Ejj(l) may be regarded as an 
element of SL{d,'L) embedded in the upper left-hand corner of SL{d+ 1,Z), so by 
assumption Eij{l) is conjugate to ii^i3(l). Otherwise, one of i or j must be equal to 
d -I- 1; we show that Eij{l) is conjugate to some matrix Est{l) with 1 < s t < d, 
hence is conjugate to £'13(1). For i — d + 1 and 1 < j < d — 1, and ior j — d + 1 
and 1 < i < d — 1, the matrix by which we conjugate is 



where /^-i is the {d — 1) x {d — 1) identity matrix. For = {d + l,d) and 

— {d,d+ 1), we conjugate by 





/ld-2 0\ 

1 

10 

\ 10 0/ 
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It remains to show that if the matrix i?i3(l) is Ul in 5*1/(3, Z), then it is Ul in 
SL{d^'L). For this, let E3 be a finite set of generators for 5'L(3, Z). By embedding 
5'L(3,Z) in the top left-hand corner of SL{d,Z), we may extend E3 to S, a finite 
set of generators for SL{d,Z). Let 7 be a Ul element of 5'L(3, Z), embedded in 
SL(d,7j). The sets of generators S3 and S induce word distance functions which 
we will denote by d^ and d^, respectively. Then, there exists a constant C-y > 
such that, for all integers n, 

dj:(l,r)<dUhr)<Cylog{\n\ + l). 

Therefore ^1^(1,7") < C^log(|n| + 1), and 7 is Ul in SL{d,Z). 

We have now shown that it suffices to prove that the matrix £'13(1) is Ul in 
5'L(3, Z). In the next part of the proof, we consider a special element and some 
subgroups of 5'L(3, Z). Let 

1 x\ 

V = { v{x,y) = I 1 y] :x,yel 
1/ 





L^VnSL{3,Z) = { 10 1 l\:k,leZ 



Multiplication in V is given by v{x, y)v{x', y') — v{x + x' ,y + y'), so V is an Abelian 
group. The group V is isomorphic to the vector space M^, where the latter is 
considered as an Abelian group under addition of vectors. We may, then, identify 
Lev with the integer lattice Z^ C M^. The action of A on y by conjugation takes 
v{x, y) to v(2x + y,x + y), which corresponds under this isomorphism to the vector 
in given by 

'2x + y\ (2 l\ (x^ 



x + y J \l 1/ 

Hence, the action of A on y by conjugation corresponds to the linear action of 
A' = j on R^. Moreover, this action preserves L, and so preserves the integer 

lattice Z^. 

We will now consider V as the vector space R^ and write the multiplication in 
it as addition. We also identify L with Z^. For v G K we will write Av meaning 
the conjugation of v by A, that is, multiplication by the matrix A', which we will 
write as A. The eigenvectors of the 2x2 matrix A are then A = (3 + \/5)/2 and 
= (3 - V5)/2. Note that 2 < A < 3 and A"^ < 1. We will denote by Vi and 
V2 fixed eigenvectors corresponding to the eigenvalues A and A~^ respectively. For 
convenience, we may choose Vi and V2 so that ||vi|| = ||v2|| = 1. We denote by Wi 

and W2 the corresponding eigenspaces of V. Let yo be the fixed vector ( J ] eV. 
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We define the following subsets of L: 

{m 

{n 
± J2 bjA~'yo ■■ bj e {0, 1, 2}, 6„ ^ 0| . 

The relationship between the sets Si and S2 and the eigenspaces Wi and W2 is 
described in the following lemma. 

Lemma 4.4.5. For k = 1,2, each point of the eigenspace is within a bounded 
distance of some point of the set Sk- 



Proof. Let 



yo 



i=l 



be a point of Si. Since Vi and V2 span R^, we have yo = avi + /3v2 for fixed real 
numbers a and /3. Then 

A'yo = aAVi + PX-W2, 

and so, 



V = ± a ^ ai AVi + ^ a^A V2 j . 

\ i=l 1=1 / 

The sum X^I^i is bounded. This is because < A and A~^ < 1, so 

1=1 



i=0 



A 



-1 ■ 



Let u belong to Wi. Then u = a'vi for some a' e R. Suppose first that a' > 0, 
and consider an element v in Si with the form 



V = q; ^ UiXWi + p'^QiX V2. 



i=l i=l 

Note here that a > 0. The distance from v to u is 



V — u 



a 



a^A^ - a' vi + i/3^ai\ M V2 



i=l 



i=l 



< a 



i=l 



+ |/3|5]a,A-\ 



1=1 



Since a and (3 are constant, and J^I^i ^j'^ * is bounded, it now suffices to prove 
that for all a > there exists an integer m > 1, and a set of integers {ai}^i, with 
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Oj G {0, 1, 2} and 7^ 0, such that 

m 



1=1 



< A. 



(4.6) 



If < a < 1 then m = 1 and ai = 1 will do. For a > 1, the proof is by induction 
on where 



<a< X^+\ 

When = 0, we have 1 < a < A, so take m = 1 and ai = 1. Now assume inductively 
that, for all / such that < / < A; and all a such that A' < a < A^"*"^, there exist 
integers with G {0, 1, 2} and a; 7^ 0, such that fl4.6l) holds. Suppose that 

X'' <a< A'=+^ Then, since 2 < A < 3, either A'^ < a < 2A^ or 2A'' < a < 3X''. Put 
Ofc = 1 in the first case and = 2 in the second case. Then 

a - akX'' < X^, 



so we have 



X <a- akX'^ < X 



i+i 



for some / < k. By the inductive assumption, there exists a set of integers {aj} 
with ttj G {0, 1, 2} and a/ 7^ 0, such that 



j=l5 



flfeA^ 



i=l 



So 



^OjA' + akX^ 



i=l 



< X. 



< A, 



and if we put = for / < i < /c, we obtain a set {aj}*Lj^, with G {0, 1, 2} and 
Ofc 7^ 0, satisfying fl4.6p . Thus ||v — u|| is bounded. 
When a' < the proof is almost the same; consider 



-a 



1=1 



i=l 



To show that every point of W2 is within a bounded distance of some point of 6*2, 
the proof is similar. □ 

Let S be the set Si + 82- Note that S* is a subset of the integer lattice L. 
Corollary 4.4.6. Each point of L is within a hounded distance of some point in S. 

Proof. The eigenvectors vi and V2 are linearly independent, so = Wi + W2- 
Each point of L, then, can be written as Wi + W2 for some Wi G Wi and W2 G H^2- 
By Lemma [4.4.5^ Wi is within a bounded distance of some point of 6*1, and W2 is 
within a bounded distance of some point of S'2. Thus wi + W2 is within a bounded 
distance of some point of Si + S'2. □ 
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We now establish a lower bound on the norm of vectors in 5*. Let v be in 5*. 
Then v may be expressed in the form 

m n 

V = ±1 ^ a,AVo ±2 h^^'yo. (4.7) 
i=i i=i 

where ±i and ±2 are independent of each other. We first prove a general result 
about linearly independent vectors. 

Lemma 4.4.7. Let wi and W2 he linearly independent vectors in the vector space 
M™, where m>2. Then there exists an M > such that, for all /ii, /i2 G M, 

||;UiWi + /i2W2|| > Mmax{||yUiWi||, ||yU2W2||}. 

Proof. The two-dimensional vector space spanned by wi and W2 may be identified 
with the complex plane C. Indeed we may, without loss of generality, identify wi 
with the point le*° = 1, and W2 with the point re*^ where r > and < 9 < n. 
Let M = sm6. Then, if fii = 0, there is nothing more to prove. Otherwise, we may 
assume without loss of generality that /ii > 0, and so we are trying to prove that 

1/^1 + l^2re^^\ > Mmax{/ii, 1/121?"} (4.8) 

for all /ii > and all fj.2 G M. Divide through this inequality by /ii and put 
R = 1^2^ I iJ^i- To establish (14. 8p . it now suffices to prove that for all i? G M, 

|1 +i?e^^| > Mmax{l, \R\}. 

Now, |1 + i?e*^|2 = 1 + 2i?cos0 + R^. We have 

{cosO + Rf > 
cos'^0 + 2Rcos9 + R^ > 

1 + 2Rcose + R'^ > sin^e. 

Also, 

{l + RcosOy > 
l + 2Rcos9 + R^cos^e > 

^ l + 2Rcos9 + R^ > R\m^9. 

It follows that 

l + 2Rcose + R^ > M^max{l,R^}. 
Take square roots of both sides to complete the proof. □ 

Note that the value of M in Lemma 14.4.71 depends only on the angle between 
the vectors wi and W2. When we use Lemma [4.4.71 in the next proof, the value of 
M will thus depend only on the angle between the fixed eigenvectors vi and V2. So 
M, as far as it is used here, is a constant. 
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Lemma 4.4.8. There exists a constant ci > such that, for any v in S, if k is the 
maximum of m and n where v has the form i4.1\) , then ciA'^ < ||v||. 

Proof. As in Lemma 14.4.51 yo = avi + f3\2 for fixed a and (3. Then, we get 

m n 

V = ±1 ^ a, (aAVi + /^A^Vs) ±2 hj (aA^^vi + (3Vv2) 



i=l 



j=l 



Let 



1^1 = (±i5^aiA*±2 5Z&iA-^ 



a, and /i2 



4 = 1 



±i^a,A-^ ±2 J]6,A^' 



By Lemma 14.4.71 there exists a constant M > such that 

||v|| > Mmax{||;UiVi||, ||yU2V2||} = Mmax{|/ii|, |;U2|}. 
Suppose first that m > n, and consider 

l/^il 



i=l 



Since 7^ 0, ^™ ^ a^A* > A*". For the other sum, we have 
It follows that 



j=0 



±i^a,A^ ±2 J]6iA- 



> 



5^6,A- 



i=l j=l 

> A™-2 

> -A". 

- 5 

Thus, ||v|| > M|yUi| > ciA"* where Ci > is constant. In the case n> m, the proof 
is similar, with a lower bound being established for \fi2\- □ 

Corollary 14.4.61 and Lemma 14.4.81 will be used in the conclusion of the proof of 
Proposition I4.4.4[ Before this, we return to the interpretation of \^ as a subgroup 
of SL(3, Z). If V e S* is considered as an element of SL{3, Z), then its word length 
is bounded as follows. 
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Lemma 4.4.9. There exists a constant C2 > such that, for any v in S, if k is 
the maximum of m and n where v has the form ([^. 7| ), then v may he written as a 
word in the elements A and Yq of SL{3, Z) of length at most C2k. 

Proof. The expression 

m n 

V = ±1 ^ aiA'yo ±2 ^ b,A~^yo 
1=1 j=i 

may be rewritten as 

V = ±1 v4(aiyo + v4(a2yo H h v4(a^„iyo + AamJo) ■■■)) 

±2 A-\biyo + A-\b2yo + ■■■ + A-\bn-iyo + A-^b^yo) ■■■))• 

Using the correspondence between and 5*1/(3, Z), we may translate this expres- 
sion for V into a word written multiphcatively in the elements 

/2 1 0\ A 1\ 

A = 1 1 and Fo = 1 1 
\0 1/ \0 1/ 

of SL{3, Z). Terms of the form a^yo have length at most 2, since < < 2. There 
are m terms of this form. Each multiplication by A in corresponds to conjugation 
by A in SL{3, Z), so we add 2 to the length for each of the m multiplications by A. 
Similar calculations for the second half of the expression show that v may be written 
as a word of length at most 2m + 2m + 2n + 2n = 4(m + n) < A{2k) = C2A;. □ 

We are now in a position to conclude the proof of Proposition I4.4.4[ Let S be 
a finite set of generators of SL{3, Z) which includes the elements A, Yq, -^13(1) and 
£'23(1), and let be the word distance function induced by the set S. Let v be 
in the set S. Using logarithm laws to rearrange the inequality ciX^ < ||v|| from 
Lemma [4.4.8[ we obtain 

^ ^ log II V II -logci 
log A 

By Lemma 14.4.91 and this inequality, the word distance function d-^ satisfies 

c?s(l, v) < C2k < -^(log ||v|| - logci) < 5 log ||v||, 
log A 

for some constant -B > 0. 

Next, by Corollary I4.4.6[ there exists some constant 61 > such that, for all 
w G L, we can find a vector v G S* so that ||w — v|| < bi. Let w be any element of 
L, and write v' = w — v, so that ||v'|| < bi. We now prove that 

f^s(l,w)<5'log(||w|| + l), 

for some constant B' > 0. Since w and v are elements of the lattice L, so is v'. We 
may then translate the vector w = v + v' G into a word, written multiphcatively, 
in terms of elements of S'L(3,Z). We use the elements A and Yq for v, and -^13(1) 
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and -£'23(1) for v'. Since the value of ||v'|| is bounded, this multiphcative expression 
for v' is bounded in length, by say 62 > 0. Thus 

ds(l,w) < ds(l,v) + ds(l,v') < Slog||v|| + 62- 
By the triangle inequality, ||v|| = ||w — v'|| < ||w|| + ||v'||, so 

Slog||v|| +62 < -Blog(||w|| + llv'll) + 62 
< Slog(||w|| +61) + 62 
<S'log(||w|| + l), 

for some constant B' > 0. 

Let n be any integer. Then the matrix (£'13(1))" is in the lattice L. It corre- 
sponds to the vector w = ( n ) ' which has norm equal to |n|. So, for some C > 0, 



^0^ 

and all integers n, 

dw{l:{E^s{l)r) < Clog{\n\ + 1). 
Thus, Ei3(l) is Ul. □ 

Proposition 4.4.10. There exist constants C3, C4 > so that any S £ S'L(2,Z) may 

be written as a word of the form d = rir2 ■ ■ ■ r^, where 

1. each Tj, for 1 < j < m, is either {^^-^ ' f^''^ some non-zero 

integer Uj, 

2. we have 

m 



where 




the point po is 2i in the upper half-plane, and du{-,-) the hyperbolic 
distance function on . 

Proof. Let T be the hyperbohc triangle with vertices at ±| + ^i and 00. Let u{ri) 

denote the matrix ^ , where n G Z, and let v be the matrix (^^^ g 

Given 6 G 5*1/(2, Z), we consider the geodesic segment [poi^Po] connecting the 
points Po and 6pQ in U"^. This geodesic segment passes through a sequence of tiles 
{^fcTj^^Q, such that 9o — 1 and 9n — S, and for 1 < A; < n, tile 6k-iT is adjacent to 
tile 9kT. Let Sk — 0^\9k- Then the product of the elements Sk is 

51-52 ■ ■ ■ Sn = Oq ^9i9i ^6*2 ■■ ■ 9^\9n = l-9n = 5. 
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Since successive tiles in the sequence {^jtTj^^Q are adjacent, we have, for 1 < /c < n, 

Sk^Ok-A e {u{l),u{-i),v}. 

Notice also that since is the identity in PSL{d,Z), there are no two consecutive 
matrices Sk-i and Sk which are both equal to v. 

We now, by reducing the product of the s^, construct a new product rir2 • • • 
which is also equal to 5, a corresponding subsequence of tiles {9jT}Y=o: and a set of 
points {pj}JLQ in U^. The basic idea is to multiply together all adjacent matrices 
of the form u{l) or u{—l). Let Sj be the first matrix in the product SiS2'''S„ 
which has the form Sj = or Si = u{—l). For all k such that 1 < k < i, let 
Tjt — Sk, and let the point Pk be Pk = O^Po- There is an integer I > such that 
Si = u{ni), Si+i = u{ni+i), Si+i = u{ni+i), but Si+i+i = v. Let n be the matrix 
Tj = u{n'j), where ni + n^+i H h rii+i. Then, 

= SjSj+i ■ ■ ■ Si+i = 6~\6i6l^6i^i ■ ■ ■ 6~^i_]6i+i = 6~\6i+i. 

Let Pi be the point 9i^ipo. Relabel as 9i, so that = and pi = 9iPo. We 

now have a product 

ri ■ ■ ■ rjSj+/+i ■ ■ ■ s„ = 5, 

a sequence of tiles {9jT}'j^Q, and a set of points {pj}*=05 such that, for I < j < i, 
Tj = 9~}^9j and pj = 9jPo. Iterate the above reduction process on the product 
Si+;+i ■ ■ ■ Sn until a product 

rir2 ■■■rm^S, 

a sequence {9jT}'JLQ and a set {pj}^o obtained, where, for 1 < j < m, rj = 9j\9j 
and Pj = 9jPQ. In this product, no two consecutive matrices rj-i and rj have the 
respective forms rj_i = u{nj^i) and r^- = ui^j), where rij^i and are non-zero 
integers. 

We now consider the path a in U'^ which is the following union of geodesic 
segments: 

a = [P0,Pl] U [pi,P2] U • • • U \pm-UPm\- 

The length of this path is 

m 

\ength(a) =^du(po,rjPo), (4.9) 

since 

m m m rn 

^ du(pj-i,Pj) = ^ du{9j^iPo, 9jPo) = ^ (i[/(po, 9j\9jPo) = ^ (i[/(po, r^Po)- 

The following lemma shows that, roughly speaking, the length of a is not too 
different from the length of the geodesic joining the points po and 5po. 
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Lemma 4.4.11. There exist constants Ki,K2 > such that 

KiduiPo,5po) < length(a) < K2du{po, 5po). 

Proof. Since the geodesic segment [poji^Po] is the shortest path from pq to 6po, we 
have immediately that 

du{po,Spo) < length(a). 

To prove the other required inequahty, for < j < m let qj be the point on the 
geodesic segment [po^^Po] which is closest to the point pj. Note that go = Po and 
Qm = Pmi and also that the point need not be in the same tile of the tesselation 
as Pj. We claim that, for 1 < j < m, 

du{po, Qj-i) < duipo, qj)- (4.10) 

This claim implies that the geodesic segments [qj^i, qj] are disjoint except for their 
endpoints, and so 

m 

duiPo, Spo) = ^ duiqj-i, qj). (4.11) 
In proving this claim, we will establish that the sequence {gj}™=o well-defined. 




\ 



Pi 



\ 




\ 




\ 




\ 
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\ 




\ 




\ 


\ 

\ 

\ 

\ 

\ / 



1 2 3 

Let L be the geodesic which contains the geodesic segment \po, Spo] 



As 9j}^ is an 



isometry, (14. lOp holds if and only if du{9j\po, Oj^^qj^i) < du{9j\po,9j\qj). Since 
is the closest point on L to Pj-i, the point 9j\qj-i is the closest point on the 
geodesic 9j}^L to Oj\pj_i = pq. Similarly, Ojliqj is the closest point on the geodesic 
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OjliL to the point OjliPj. Now, the geodesic L intersects the tiles 9jT and 9j-iT, 
and the tiles 9jT and 9j-iT are adjacent. So, the geodesic 9~}iL intersects the tiles 
9~\9jT and T, and these tiles are also adjacent. By symmetry in the imaginary 
axis, we need only consider the cases 9~\9jT = u{l)T and 9j}^9jT = vT. Thus 
the point 9j}^pj is either, respectively, 1 + 2i, or i/2. To simplify notation, we now 
write L for 9j\L, p for po, p' for 9j}^pj, q for 9j}^pj-i and q' for 9j\qj. 

The first case is when p' = l + 2i. Here, the geodesic L is a semicircle orthogonal 
to the real axis, which intersects the tiles T and u{l)T. We may parametrise z & L 
by 

z = a + re"'', 

where a G M, r > and G (0, tt). Since L meets T and u{l)T, 

r> ^~ l^ + 'Y''^ ^ r^>a^-a+l. (4.12) 

The distance from the point n + 2i, where n G Z, to a point z = a + re**^ G L is 

, 1 /_ (n — a — r cosf/))^ + (2 — r sinf/))^ 
du {n + 2i, z) = cosh-Ul + ^ ^' ^ ^' 



4r sin 

For fixed n, a and r, this function has a unique minimum when 

2(n — a)r 

cos0 = - 4.13 

(n - a)2 + r2 + 4 

Thus, the points q and q' are the unique closest points on L to p and p' respectively. 
So, in this first case, the choice of the qj is well-defined. If g = a + re*''^ and 
q' = a + re"^ , to prove fl4.10l) it now suffices to prove that 0' < 0. By (14.131) . 

-2ar 2(1 -a)r 

cos = — and cos 



a2 + r2 + 4 ^ (l-a)2 + r2 + 4 

If < a < 1, then by considering the signs of cos 0' and cos 0, it follows that 0' < 0. 
If a > 1 or a < then cos and cos 0' have the same sign. Since cosine is decreasing 
on (0,7r), it suffices to prove that cos0 < cos0'. The inequality 

-2ar 2(1 -a)r 

< 



a2 + r2 + 4 (1 - a)2 + r2 + 4 

holds if and only if a2 < a + r2 + 4. By (I4.12p . we have 

a + + 4 > a + - 1 > a^, 

and so cos0 < cos0'. This completes the proof of the case p' = 1 + 2i. 

The second case is when p' = i/2. Here, L intersects the tiles T and vT, so is 
either a vertical line, or a semicircle orthogonal to the real axis. If L is a vertical 
line, to prove (I4.1UI) it suffices to prove that Im(g') < Im(g). We may parametrise 
z&LhYz = a + iy for some a G [—1/2, 1/2]. Then the unique closest point on 
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L to p = 2i is q = a + %\J o? + 4, and the unique closest point on L to = i/2 is 
= a + i(v4c? + l/2). Therefore the are well-defined here. It is straightforward 
to prove that v4a^ + l/2 < vc?+4. If L is a semicircle, we again parametrise 
^;GLby2; = a + re*"^. Without loss of generality, a > (if a = then L cannot 
intersect both T and f T) . Again, the closest points to p and p' are unique; we find 
that the unique minimum distances from p and p' are achieved when, respectively, 

, — 2ar , — 8ar 
cosffl = — , and cosd) = — . 

So g = a + re*''^ and q' = a + re*"^'. To establish fl4.10p . it suffices to prove that 
(f)' > (f). Since coscj) < and coscj)' < 0, we wish to show that cos0 > cos0'; this is 
again a straightforward inequality. 

We have now proved ( ]4.10p . and so ( 14. lip holds. The plan for the rest of the proof 
of Lemma 14.4.111 is as follows. After considering two special cases, we show that 
when rj = uijij), there exists a constant K2 such that du{pj-i,Pj) < Kl^duiqj-i, qj)- 
This involves parametrisation of a geodesic which is a semicircle with centre a and 
radius r. The case j = 1 is treated first, then for j > 1 we find the distance between 
qj-i and qj in terms of a, r and Uj, and thus show that there exists a constant K so 
that cosh~^(l + n'j/K) < duiqj-i, qj)- From the explicit formula for du, it follows 
that du{pj^i,pj) < K'^du^qj-i^qj)- Then, we consider the Vj which are equal to v. 
We show that 

du{Pj~i,Pj) +duiPj,Pj+i) < 4:K2{duiqj-i,qj) + duiqj, qj+i)). 

Adding together these kinds of inequalities completes the proof. 

In the special case where S is the identity. Lemma 14.4.111 is trivial. The other 
special case is when S = v. Here, we have du{po,Pi) = du{qQ,qi), so the path a is 
identical to the geodesic segment [po, ^Po]- Provided K2 > 1, there is nothing more 
to prove in this case. So if, later, we arrive at a value of K2 which is less than 1, 
we will take K2 = 1 instead. 

For all other 6 G S'L(2,Z), at least one of the rj must have the form u{nj), 
where rij is a non-zero integer. We now show that when rj = u{nj), there exists a 
constant K2 such that 

duiPj-i,Pj) < K2duiqj-i,qj). (4.14) 

We have 

duiPj-i,Pj) = dui0j^ipo,6jpo) = du{po,Oj\ejPo) = du{po,rjPo). 

Also, 

du{qj-i,qj) = du{6j\qj-i,0j\qj). 

Since in both cases we have applied the isometry Oj^^, we may now, without loss 
of generality, compare the distances between the points po and rjpo, and Oj^qj^i 
and Oj^^qj. By symmetry in the imaginary axis, we may in addition assume that 
rii > 1. 
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We are considering geodesies L which contain the image of [po, 5po\ under O^^i^ 
and which, since po & T and rjpo e u{nj)T, intersect the tiles T and u{nj)T. We 
may, as above, parametrise z&L hy z — a-\- re**^, where a e R, r > and 

< < TT. 

If j = 1, then the isometry 9j\ is just the identity. Thus, as Po = qo = 2i, the 
geodesic L passes through the point 2i in the tile T. Now, L either intersects the 
arc of the circle |z| = 1 between the points z = ^ + and z = + ^i, or L 
intersects the ray R given by 



1 \/3 
izeU^ : Re{z) = -- and lm{z) > 



The case where L intersects the arc may be dealt with as below, where is not 
the identity. We now suppose L intersects the ray R in the point w. Then, the point 
a, which is the centre of the semicircle L, is where the perpendicular bisector of the 
points w and 2i meets the real axis. The value of a thus has a maximum, when 
w = — I + ^i. It follows that there are only finitely many positive integers n\ such 
that Pi = ni + 2i. Let N be the maximum possible value of ni. By the reductions 
carried out above to form the product t\Ti - • -Tmi as the angle increases, the 
geodesic L must pass straight from a tile not of the form u{m\)T ^ where m\ e Z, 
into the tile u{n\)T . Now, for 1 < ni < A^, the smallest value of du{2i,qi) occurs 
when L passes through the vertex (ni — |) + of the tile u{ni)T. Since there 
are only finitely many cases to consider, we can thus find a K'2 sufficiently large so 
that, for each instance 1 <ni < N, 



du{Po,Pi) ^ du{2i,ni + 2i) < K2du{2i,qi) ^du{qo,qi)- 



For all J > 1, to simplify notation, write n — rij, p = Pq, p' = n+po, q 



and q' — Oj\qj. We first establish bounds on the values of a and r in terms of n. 
By the reductions carried out above to form the product rir2 ■ ■ - rm, as the angle 
(f) increases, the geodesic L must pass straight from a tile not of the form u{mi)T 
into u{n)T, and later on from T straight into a tile not of the form u{m2)T, where 
mi, 1712 G Z. 



P 




p 



n 
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Thus, the radius of L is bounded by the inequahties 



a — [n 



(n-l/2 + iV3/2^ <r< a - (n + 1/2 + iV3/2 



and 



- (^1/2 + iV3/2 



< r < 



-l/2 + iV3/2 



By squaring, expanding and combining fl4.15p and (14.161) . we get 

n — 1 n + 1 
<a< . 



(4.15) 
(4.16) 

(4.17) 



Then, using (I4.16P and (14.171) . we obtain 

< < 



- 4ra + 7 ^ 2 ^ + ^n + 7 



(4.18) 



Now, we find the hyperbohc distance between q and q' in terms of a, r and n. 
Using (14.131) . the closest point to p = 2i on the geodesic L is 



q = a + 



2ar^ A/(a2 + + 4)2 - 4a^r^ 



q2 _^ ^2 _^ 4 0,2 _^ ^2 _^ 4 

and the closest point to p' = n + 2i on the geodesic L is 



g = a + 



2(n-a)r2 Jiin - a)2 + r2 + 4)2 - 4(n - a)2r2 

_|_ ^^_v 



(ra — a)2 + r2 + 4 {n — a)2 + r2 + 4 

Then, the distance between q and g' is, for fixed n, 

F(a, r 



c^i/lg) g') = cosh ^ ^1 



(?(«, r) 



where F : IR2 ^ ]^ jg function 

F(a,r) = \q- qf 

= (Re(g) - Re(g'))^ + (Im(g) - Im(g')) 

.2 o„^2 \ 2 



2(n — a)r 



2ar' 

+ o . o . . I + 



'n-a)2 + r2 + 4 a2 + r2 + 4^ 
A/(a2 + r2 + 4)2 - 4a2r2 A/((n - a)2 + r2 + 4)2 - 4r2(n - a)2 



a2 + r2 + 4 



(n - a)2 + r2 + 4 



IS 



and G : M2 _ 

(^(a, r) = 2 Im(g) Im(g') 



2r- 



v/((a2 + r2 + 4)2 - 4a2r2)(((r2 - a)2 + r2 + 4)2 - 4r2(r2 - a)2) 
(a2 + r2 + 4)((n - a)2 + r2 + 4) ' 
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From the picture of the geodesic L above, it is clear that Im(g) may equal 
Im(g'). To show that F{a,r) is bounded below, we thus consider the difference 
between Re(g) and Re(g'). We then show that G{a, r) is bounded above. These facts 
together will prove that for all n > 1, and all a and r satisfying (14 .171) and (14.181) 
respectively, 

^ < (4 19) 

where > is a constant. This implies a lower bound on du{q, q'). 

From fl4.17p . we have a < (n+ l)/2, and n — a < (n + l)/2. Using these bounds 
and KW) . 

2{n — a)r'^ ^ 2ar^ ^ 2{n — a)r'^ + 2ar^ 



(n - a)2 + r2 + 4 + + 4 ~ (n + 1) V4 + (n^ + 4n + 7)/4 + 4 

2nr^ 



> 



n72 + 3n/2 + 6 
n{n^ - 4n + 7) 



n2 + 3n + 12 

The right-hand side of the final line goes to n as n — > oo, so 



^^^'""^ - V(n-a)2 + r2 + 4 ^ a2 + r2 + 4j "A;' 

for some positive constant k. For G, by a similar process using (14.171) and (14.181) 
repeatedly, we obtain 

The function of n on the right-hand side converges to a constant as n ^ oo, so 
G (a, r) is bounded above, by say Kj k. The lower bound on F{a, r) / G{a, r) at (14.191) 
follows. Therefore, the distance between the points q and q' is bounded below: 

We can find a constant K2 > such that for all n > 1, 

1 + y j < K'^cosh-^ [^^^ 

Therefore, du{p,p') < K2du{q,q')- We have established (I4.14p . 

We next consider the rj of the form rj = v, where j is strictly less than m. 
Rather than comparing the distances du{qj-i, qj) and du{pj-i,Pj), we show instead 
that 

duiPj-uPj) + duiPj,Pj+i) < 4^K'2 iduiqj-i,qj) + duiqj,qj+i)) ■ (4.20) 
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When rj = v, we have 

duiPj-uPj) = duiPo,rjPo) = du{2i,i/2). 

By the construction of the product rir2 ■ ■ ■ r^,, and the assumption that j < m, if 
rj = V then r^+i = u(nj+i) for some non-zero integer n^+i. So, by fl4.14p . we have 

du{Pj,Pj+i) < K'^du{qj,qj+i). 
Then, the following proves fl4.20p : 

du{Pj-uPj) + duiPj,Pj+i) = du{2i,i/2) + duiPj,Pj+i) 

= log4 + duiPj,Pj+i) 

< 3cosh~i(l + 1/8) + du{pj,Pj+i) 

< 3cosh"^(l + r2^+i/8) + du{pj,Pj+i) 
= 3du{po,Po + rij+i) + duiPj,Pj+i) 

= 3du{po, rj+ipo) + du{pj,Pj+i) 
= 3duiPj,Pj+i) + duiPj,Pj+i) 
= AduiPj,Pj+i) 

< AK'^du{qj,qj+i) 

< AK'2 {duiqj-i, qj) + du{qj, qj+i)) ■ 

The final case is when j = m and = v. Then Vm-i must be u{nm-i) for some 
non-zero integer rim-i- It can be proved, as for f l4.20p . that 

du{Pm-2,Pm-l) + du{Pm-l,Pm) < AK'^ {du {qm-2 , qm-l) + du{qm-l, qm)) ■ 

Let the constant K2 be K2 = 2 x AK!^. (Multiplication by 2 is needed to cater 
for the possibility of counting di/{qm-2,qm-i) twice.) Then, 

m m 

^du{pj-i,Pj) < K2^du{qj-i,qj), 

and so by (gS]) and fOT]) . length(a) < K2du{po, Spo). □ 

The next lemma shows that the length of the path a is also roughly the same 
as the sums of the values of / at each of the terms rj. 

Lemma 4.4.12. There exist constants K3, K4 > such that 

m 

ir3length(a) < ^/(r^) < 7^4 length (a). 
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Proof. If Tj = V, then du{po,rjPo) = log 4 and f{rj) = 1. If rj = u{nj) for rij 0, 
then du{po,rjPQ) = cosh~^ (l + ri^/S) and /(r^) = log(|nj| + 1). Let the set J be 

J = {jE'Ij:l<j<'m and rj = v} , 

and denote its cardinality by \J\. Then 

length(a) = ^rf[/(po, r^po) = l-^'l log4 + ^cosh^^ (l + , 

and 

TTl 

Now, for all rij 7^ 0, 

cosh-^(l + n^V8) < 31og(|nj | + 1) 

and, also, 

log(|nj| + 1) < 3coslr^(l + njV8)- 

It follows that 

- length(a;) < < 31ength(Q;). 



i=i 



□ 



We may now conclude the proof of Proposition 14.4. 101 By Lemmas 14.4.111 
and 14.4. 12| there exist constants Ki, K2, and such that 

m 

KsKiduipo.Spo) < ir3length(a) < ^/(r^) < i^4length(a) < K^K2duipo, 5po). 

□ 

Lemma 4.4.13. Let po be the point 2i in f/^. Then there is a constant K^^ > 
such that for any S G SL{2, Z), 

du{Po,Spo) < i^slogll^ll. 

Proof. Let 6 be in 5*1/(2, Z). As shown in Lemma [2.6.161 there exist matrices ki 
and k2 in S0{2, M), and a matrix a in SL(2, M) of the form j where s > 1, 
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such that 6 = kiak2. Then, by Lemma 13.5. ![ = ||a|| = s. Since the group 
5*0(2, M) stabihses i, we have du{i,Si) = du{i,ai) = du{i,s'^i) = 2 logs. Thus, 

duiPo, Spo) < dui^i, i) + du{i, Si) + du{6i, S{2i)) 
= 2du{'2i, i) + du{i, Si) 
= 21og2 + 21ogs 
= log4 + 21og| 

<^5log 



for some constant > 0. □ 

Corollary 4.4.14. There exists a constant Kq > such that, for all s and t where 
1 < s ^t <d, and all6 e SL''\2, Z) C SL{d, Z), 

dw{l,S)<Kelog\\6\\. 

Proof. Let rir2 ■ ■ ■ be a word for 6 as in Proposition 14.4.1^ where each vj belongs 
to S'L*'*(2,Z). Let S be a fixed finite set of generators of SL{d,1j), and let be 
the word distance function on SL{d,Z) induced by S. 

A n 

First, we consider the terms rj which are of the form ( ^ ^ 

integer Uj. Such elements may be written as vj = {Est{^))"'^. By Proposition I4.4.4t 
there exists a constant Cst such that 



dj:{l,rj)<Cstlog{\nj\ + l)=Cstf{r 



3/1 



where / is the function defined in the statement of Proposition I4.4.10[ Let C > 
be the maximum of the constants Cst over the finitely many values of s and t such 
that l<s^t<d. Then r^) < Cf{rj). 

If the element Vj is of the form i^j = \ ^ 1 ' ^^^^ '"i ^lay be written as 



^-1 0^ 

a shortest word of fixed length Lgt in our generators S. For such rj, we have 
f{rj) = 1. Let L > be the maximum of the constants Lgt over the finitely many 
values of s and t such that 1 < s t < d. Then ^2(1,''"^) < Lf{rj). 

Let M be the maximum of L and C. By Proposition 14.4.101 and Lemma 14.4. 13[ 
the word length of 5 with respect to S is bounded as follows: 

m m 

c?e(1,^) < X;^s(l,r,) < M J]/(r,) < Mcdu{po.5po) < Mc4i^5 log 

Thus there exists a constant Kq>Q such that dw{^,5) < f^e log □ 
4.-4-4 Bounds on the operator norm of a matrix 

Lemma 4.4.15. There exist constants C5, ce > such that, for any d x d real 
matrix A = [aij), 

C5max|aij| < \\A\\ < C6max|aij|. 
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Proof. Suppose the entry of A which has maximum absolute value occurs in column 
k. Then 



A\\ = sup ||y4x|| > \\Aek\\ = f ^ kifclM > (j^max\aik\J ^ 



max I ttij I 



For the other inequality, let x be any vector such that ||x|| = 1, and write y = Ax. 
For 1 < j < d, we have \xj\ < 1, so 

Px|| = ||y|| 

d \ 1/2 



2 



=1 

max \ y, 

i 

— Vdmax 



j=i 



d 



< \fdd max | aijXj \ 

< d^^^ max |ajj|. 

id 



Thus, 



\A\\ = sup 1|AX|| < rf3/2 

max I a 

||x||=l 



Note that we may take the constant C5 to be 1. □ 



4-4-5 Conclusion of the proof of Theorem 4-4-^ 



Let 7 be in SL{d, Z). By Corollary 14.4.31 7 may be written as a word 

7 = S1S2 ■ ■ - 5^2, 

where each Si G SU'^{'L) for some 1 < s 7^ t < d. Moreover, the entries of these 
5i are bounded by a fixed polynomial P in the entries of 7; we will write P(7) for 
-P(7ii, 712, • • • , 7dd)- Then for 1 < i < (i^, the (A;, /) entry of the matrix 5i satisfies 

< \pii)\- (4.21) 

Let 6 be the matrix in the expression S162 ■ ■ - ^dP which has maximum norm. That 
is, 5 = 5i for some 1 < < c?^, and > for all 1 < i < d'^. Then, by 
Corollary 14.4.141 the word distance from 1 to 7 is bounded as follows: 

c^w^(l,7) < J]t^iy(l,^i) < I]^6log||5i|| <K^\og 

i=l i=l 
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where Kj is some positive constant. Using Lemma [4.4. 15[ and fl4.2ip . and writing 
5 = ((^fc/), we then obtain 

K^\og\\5\\<Kj\og(^c,m^^\5u\^ < log {c,\P{-i)\) . 

Let the degree of the polynomial P be deg(P), and let the sum of the the absolute 
values of the coefficients of P be S. Then for all 7, 

deg{P) 

|P(7)| <^(max|7.,| 



so 



K,\og{C^\P{-i)\) < Kr\og C65(max|7,,|) 



,,hdeg(P) 
deg(P) 



< Kflog CqS max |7ij| 



K-r deg(P) log ( cqS max \-fij\ 



< iriog max|7jj| 



for some constant i^' > 0. Finally, using Lemma [4.4.151 again. 

Klog ^niax|7y|^ </riog||7||. 

Thus 

^^-(1,7) <iriog||7i|, 
and the proof of Theorem 14.4.11 is complete. 



97 



References 



[1] Alan F. Beardon. The Geometry of Discrete Groups. Springer- Verlag, 1983. 
[2] H. J. M. Bos and H. Mchrtcns. The interactions of mathematics and society 

in history: Some explanatory remarks. Historia Mathematica, 4:7-30, 1977. 
[3] Carl B. Beyer. A History of Mathematics. John Wiley & Sons, Inc., 1991. 
[4] John B. Conway. A Course in Functional Analysis. Springer- Verlag, 1990. 
[5] Jeremy Gray. Ideas of Space: Euclidean, Non-Euclidean and Relativistic. 

Clarendon Press, 1989. 
[6] Jeremy Gray. Geometry — formalisms and intuitions. In Jeremy Gray, edi- 
tor. The Symbolic Universe: Geometry and Physics 1890-1930, pages 58-83. 

Oxford University Press, 1999. 
[7] Jeremy Gray. Introduction. In Jeremy Gray, editor, The Symbolic Universe: 

Geometry and Physics 1890-1930, pages 1-21. Oxford University Press, 1999. 
[8] Sigurdur Helgason. Differential Geometry, Lie Groups, and Symmetric Spaces. 

Academic Press, 1978. 
[9] Luke Hodgkin. Mathematics and revolution from Lacroix to Cauchy. In Herbert 

Mehrtens, Hcnk Bos, and Ivo Schneider, editors. Social History of Nineteenth 

Gentury Mathematics, pages 50-71. Birkhauser, 1981. 
[10] James E. Humphreys. Arithmetic Groups. Springer- Verlag, 1980. 
[11] Morris Kline. Mathematical Thought from Ancient to Modern Times. Oxford 

University Press, 1972. 
[12] J. Larrieu. Principles of Linear Algebra. Gordon and Breach, 1972. 
[13] Alexander Lubotzky, Shahar Mozes, and M. S. Raghunathan. Cyclic subgroups 

of exponential growth and metrics on discrete groups. G. R. Acad. Sci. Paris, 

317:735-740, 1993. 

[14] Alexander Lubotzky, Shahar Mozes, and M. S. Raghunathan. The word and 
Riemannian metrics on lattices of semisimple groups. Inst. Hautes Etudes Sci. 
Publ. Math., 91:5-53, 2001. 

[15] Arthur Miller. Einstein, Poincare, and the testability of geometry. In Jeremy 
Gray, editor. The Symbolic Universe: Geometry and Physics 1890-1930, pages 
47-57. Oxford University Press, 1999. 

[16] John Milnor. Hyperbolic geometry: The first 150 years. Bulletin of the Amer- 
ican Mathematical Society, 6:9-24, 1982. 

[17] John D. Norton. Geometries in collision: Einstein, Klein and Riemann. In 
Jeremy Gray, editor. The Symbolic Universe: Geometry and Physics 1890- 
1930, pages 128-144. Oxford University Press, 1999. 

[18] Plato. Meno. Aris & Phillips Pubhshers, 1985. 

[19] Henri Poincare. Science and Hypothesis. Dover Publications Inc., 1952. 
[20] Proclus. A Gommentary on the First Book of Euclid's Elements. Princeton 
University Press, 1969. 



98 



[21] John G. Ratcliffc. Foundations of Hyperbolic Manifolds. Springer- Verlag, 1994. 
[22] B. A. Rosenfeld. A History of Non-Euclidean Geometry: Evolution of the 

Concept of a Geometric Space. Springer- Verlag, 1988. 
[23] John StiUweU. Introduction to Poincare's 'Theory of Fuchsian groups', 'Memoir 

on Kleinian groups', 'On the apphcations of non-Euchdean geometry to the 

theory of quadratic forms'. In John Stillwell, editor. Sources of Hyperbolic 

Geometry, pages 113-122. American Mathematical Society, 1996. 
[24] Roberto Torretti. Philosophy of Geometry from Riemann to Poincare. D. 

Reidel Publishing Company, 1984. 
[25] Scott Walter. The non-Euclidean style of Minkowskian relativity. In Jeremy 

Gray, editor. The Symbolic Universe: Geometry and Physics 1890-1930, pages 

91-127. Oxford University Press, 1999. 



99 



